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invention also relates to an assay for diagnosing ehrlichiosis in humans using a recombinant outer membrane protein of E. chafeensis, 
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OUTER MEMBRANE PROTEIN OF EHRLICHIA CANlS 
AND EHRLICHIA CHAFFEENIS 

This work was supported by grant ROI AI40934 from National Institutes of Health. The government has 
certain rights in this invention. 

BACKGROUND OF THE INVENTION 
The ehrtichiae aie obligate mtracellular bacteria that infect circulating leucocytes. Ehrlichia chajeemis 
infects the monocytes and macrophages in humans and causes human monocytic ehrlichiosis. The clinical 
manifestations of ehrlichiosis in humans are nonspecific and similar to Rocky Mountain spotted fever. The clinical 
manifestations include fever, chills, headache myiagia or vomiting and weight loss. Most patients have a history of 
tick exposure. 

Ehrlichia canis infects and causes ehrlichiosis in animals belonging to the farnily Canidae. Canine 
ehrhchiosk consists of an acute and a chronic phase. The acute phase is characterized by fever, serous nasal and 
ocular discharges, anorexia, depression, and loss of weight The chronic phase is characterized by severe 
pancytopenia, epktaxis, hematuria, blood m feces in addition to more severe clinical signs of the acute disease. If 
treated early during the course of the disease, dogs respond well to doxycycline. However, chronically infected 
dogs do not respond well to the antibiotic Therefore, early diagnosis is very important for treating canine 
ehrlichiosis. 

The primary diagnostic test for diagnosing canine ehriicbiosk and human ehrlicliiosis is the mdirect 
fluorescent antibody (IFA) test This test uses the etiologic agent Ehrlichia canis to diagnose canine ehrlichiosis. 
The IFA test uses Ehrlichia chafeemis as antigen for diagnosing human ehrlichiosis. The IFA test has, however, 
serious limitations. The IFA test is subject to false positives because the antigens are made of whole infected cells 
which comprise many nonspecific proteins which will cross-react with sera from some patients. The IFA test is also 
subject to take negatives because IFA antigens are unstable and may become inactivated during storage. In 
addition the IFA test requires a special equipment to perform the test For example, the IFA test requires a tissue 
culture system for growing the bacterium that are used to prepare the antigen slides, a fluorescent microscope, and 
trained persons to evaluate the serum reactivity to the bacterial antigen on me slide. 

Took which permit simpler, more rapid; and objective serodiagnosk of canine ehrlichiosis or human 
ehrlichiosis are desirable. 

SUMMARY OF THE INVENTION 

The present invention relates to mmro ved diagnostic took for veterinary and human use which are used for 
serodiagnosing ehrlichiosis in mammals, particularly in members of the Canidae farnily and in humans. 

The present invention also provides polynucleotides or nucleic acids which encode the outer membrane 
proteins of E. chafeensis. The OMP-1 polynucleotide encodes an OMP-1 protein of E. chafeensk having a 
molecular weight of about 27.7 kDa and an amino acid sequence which k at least 85% homologous to the amino 
acid sequence shown in FIG3B, SEQ ID NO: _. The OMP-1B polynucleotide encodes an OMP-1B protein of E. 
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chafeensis having a molecular weight of about 282 kDa and an amino acid sequence which is at least 85% 
homologous to the amino acid sequence shown in FIG. 4B, SEQ ID NO: The OMP-IC polynucleotide encodes 
an OMP-IC protein of E. chafeensis having a molecular weight of about 27.6 kDa and an amino acid sequence 
which is at least 85% homologous to the amino acid sequence shown in FIG. 5B, SEQ ID NO: _- The OMP-1D 
polynucleotide encodes an OMP-1D protein of E. chafeensis having a molecular weight of about 28.7 kDa and an 
amino acid sequence which is at least 85% homologous to the amino acid sequence shown in FIG. 6B, SEQ ID NO: 
_. The OMP-I E polynucleotide encodes an OMP-1E protein of E. chafeensis having a molecular weight of about 
27.8 kDa and an amino acid sequence which b at least 85% homologous to the amino acid sequence shown in FIG. 
7B, SEQ ID NO The OMP-1F polynucleotide encodes an OMP-1F protein of E. chafeensis having a molecular 
weight of about 27.9 kDa and an amino acid sequence which is at least 85% homologous to the amino acid 
sequence shown in FIG. 8B, SEQ ID NO: _. The OMP-1A polynucleotide encodes an OMP-1A protein of E. 
chafeensis having a molecular weight of about 29.6 kDa and an amino acid sequence which b at least 85% 
homologous to the amino acid sequence shown in FIG. 9B, SEQ ID NO: The OMP-1 R polynucleotide encodes 
an OMP-1R protein of E. chafeensis having a molecular weight of at least 23 kDa and comprising an amino acid 
sequence which b at least 85% homologous to the amino acid sequence shown in FIG. 10B, SEQ ID NO. _. The 
OMP-1S polynucleotide encodes an OMP-1 S protein of E. chafeensis comprising an amino acid sequence which is 
at least 85% homologous to the amino acid sequence shown m FIG. UB, SEQ ID NO: _. The OMP-1T 
polynucleotide encodes an OMP-1 T protein of E. chafeensis comprising an amino acid sequence which is at least 
85% homologous to the amino acid sequence shown in FIG. 12B, SEQ ID NO. _. The OMP-1U polynucleotide 
encodes an OMP-1U protein of E. chafeensis having a molecular weight of about 30.6 kDa and an amino acid 
sequence which is at least 85% homologous to amino acid sequence shown in FIG. 13B, SEQ ID NO: — . The 
OMP-1 V polynucleotide encodes an OMP-1 V protein of E. chafeensis having a molecular weight of about 28.0 kDa 
and an amino acid sequence which b at least 85% homologous to the amino acid sequence shown in FIG. 1 4B, SEQ 
ID NO The OMP-1 W polynucleotide encodes an OMP-1 W protein of E. chafeensis having a molecular weight 
of about 28.8 kDa and an amino acid sequence which b at least 85% homologous to the amino acid sequence 
shown in FIG. 15B, SEQ ID NO: _. The OMP-1X polynucleotide encodes an OMP-1S protein of E. chafeensis 
having a molecular weight of about 27.8 kDa and an amino acid sequence which b at least 85% homologous to the 
amino acid sequence shown in FIG. 16B, SEQ ID NO: _. The OMP-IY polynucleotide encodes an OMP-1Y 
protein of E. chafeensis having a molecular weight about 28.8 kDa and an amino acid sequence which b at least 
85% homologous to the amino acid sequence shown in FIG. 17B, SEQ ID NO: _. The OMP-1 Z polynucleotide 
encodes an OMP-1 Z protein of E. chafeensb comprising an amino acid sequence which b at least 85% homologous 

to the amino acid sequence shown in FIG. 18B, SEQIDNO: . 

The outer membrane proteins from E. chafleensb, particularly a recombinant form of OMP-1, are 
immunogenic and, thus are useful for preparing antibodies. Such antibodies are useful for immunolabeling isolates 
of E. chafeensb and for detecting the presence of E. chafeensb in body fluids, tissues, and particularly in 
monocytes and macrophages. The isolated outer membrane proteins, particularly OMP-1, are also useful for 
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detecting antibodies to E. chafeensis in the blood of patients with clinical signs of ehrlichiosis- The isolated outer 
membrane protein, particularly OMP-1, are also useful rmmunogens for raising antibodies that are capable of 
reducing the level of infection in an immunized mammal that has been infected with E. chafeensis. The isolated 
membrane proteins are also useful in a vaccine for protecting against infection with E. chafeensis. 

The present invention also relates to isolated polynucleotides which encode 30 kDa outer membrane 
proteins from Ehrlichia canis. The proteins are designated P30 and P30a. The proteins, particularly P30, are 
immunogenic and are, thus, useful for preparing antibodies that are useful for nnmunolabeling isolates of E. canis. 
The P30 protein is also useful for diagnosing canine ehrlichiosis in mammals, particularly in members of the family 
Cankfae, most particularly in dogs and for diagnosing infections with E. chafeensis in humans. The P30 protein is 
also a useful immunogen for raising antibodies that reduce the level of infection in an immunized mammal that has 
been infected with E. canis. The P30 protein is also useful in a vaccine for protecting animals against infection with 
Rcanis. 

The present invention also provides the following isolated proteins of E. chafeensis OMP-1 (also known as 
p28X OMP-1A, OMP-IB, OMP-1C, OMP-1D, OMP-1E, OMP-1F, OMP-1 R, OMP-1S, OMP-1T, OMP -1U, 
OMP-1 V, OMP-1 W, OMP-1X, and OMP-1Z, referred to hereinafter collectivery as the "OMP family". The present 
invention also provides the following isolated proteins of E. canis P30, P30-a, P30-1, P30-2, P30-3^ P30-4, P30-5, 
P30-6, P30-7, P30-8, P30-9, and P30-10, referred to hereinafter as the P30 family. 

The present invention also relates to an assay for diagnosing ehrlichiosis in humans using a recombinant 
outer membrane protein of E. chafeensis, particularly OMP-1. The present invention also relates to an assay for 
diagnosing ehrlichiosis in humans and members of the family Canidae using a recombinant outer membrane protein 
of R canis, particularly P30. 

Brief Description of the Figures 

FIG. 1 . shows the DNA sequence of and the amino acid sequence encoded by the K chafeensis (p28) gene 
cloned in pCREp28. The N-terminal amino acid sequence of native omp-1 protein (P2S) determined chemically is 
underlined. Five amino acid residues at the N terminus of P28 which were not included in the p28 gene, are 
indicated by boldface. Arrows indicate annealing positions of the primer pair designed for PCR 

FIG. 2. shows the restriction map of 63-kb genomic DNA including the omp-1 gene copies m & 
chafeensis. The four DNA fragments were cloned from the genomic DNA (pPS2.6, pPS3.6, pEC2.6, and pEC3.6). 
A recombinant plasmid pPS2.6 has an overlapping sequence with that of pEC3.6. The closed boxes at the bottom 
show PCR-amplified fragments from the genomic DNA for confirmation of the overlapping area. Open boxes at 
the top indicate open reading frames (ORF) of omp-1 gene copies with direction by arrows. Open boxes at the 
bottom show DNA fragments subcloned for DNA sequencing. 

FIG. 3B shows one embodiment of the OMP-1 protein; FIG. 3A shows one embodiment of the OMP-1 
polynucleotide. 

FIG. 4B shows one embodiment of the OMP-IB protein, FIG. 4A shows one embodiment of the OMP-IB 
polynucleotide 
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FIG. 5A shows one embodiment of the OMP-1C polynucleotide; FIG 5B shows one embodiment of the 
OMP-1C protein. 

HG. 6B shows one embodiment of the OMP-iD protein; FIG. 6A shows one embodiment of the OMP-ID 
polynucleotide. 

FIG. 7A shows one embodiment of the OMP-1E protein; FIG 7B shows one embodiment of the OMP-1E 
polynucleotide. 

Ha 8A shows one embodiment of the OMPrlF protein; FIG 8 B shows one embodiment of the OMP-IF 
polynucleotide. 

FIG. 9B shows one embodiment of the OMP-1A protein, FIG 9A shows one embodiment of the OMP-1A 
polynucleotide. 

FIG. 10 B shows one embodiment of a portion of the OMP-1R protein, FIG 10A shows one embodiment of 
ah OMP-1R polynucleotide encoding such polypeptide. 

FIG. 11 B shows one embodiment of a portion of the OMP-1S protein, FIG 1 1 A shows one embodiment of 
the OMP- 1 S polynucleotide encoding such polypeptide. 

HG. 12 B shows one emrxxiirnent of a portion of the OMP-1T protein, FIG 12A shows one embodiment of 
the OMP-lf polynucleotide encoding such polypeptide. 

HG. 13 B shows one embodiment of the OMP-1U protein, FIG 13A shows one embodiment of the OMP- 
1U polynucleotide. 

FIG. 14 B shows one embodiment of the OMP-1V protein, FIG 14 A shows one embodiment of the OMP- 
IV polynucleotide. 

FIG. 15 B shows one embodiment of the OMP-IW protein, FIG 15A shows one embodiment of the OMP- 
1W polynucleotide. 

FIG. 16 B shows one embodiment of the OMP-1X protein, FIG 16A shows one embodiment of the OMP- 
IW polynucleotide. 

FIG. 17 B shows one embodiment of the OMP-1Y protein, FIG 17A shows one embodiment of the OMP- 
1Y polynucleotide. 

FIG. 18 B shows one embodiment of the OMP-1Z protein, FIG 18A shows one embodiment of the OMP- 
1Z polynucleotide. 

FIG. 19 B shows one embodiment of the P30 protein, FIG 19A shows one embodiment of the P30 
polynucleotide. 

FIG. 20 B shows one embodiment of the P30a protein, FIG 20A shows one embodiment of the p30A 
polynucleotide. 

FIG. 21 B shows one embodiment of the P30-1 protein, FIG 21 A shows one embodiment of the p30-l 
polynucleotide. 

FIG. 22 B shows one embodiment of the P30-2 protein, FIG 22 A shows one embodiment of the p30-2 
polynucleotide. 
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FIG. 23 B shows one embodiment of the P30-3 protein, FIG 23 A shows one embodiment of the p30-3 
polynucleotide. 

FIG. 24 B shows one embodiment of the P30-4 protein, FIG 22 A shows one embodiment of the p30-4 
polynucleotide. 

FIG. 25 B shows one embodiment of the P30-5 protein, FIG 22 A shows one embodiment of the p30-5 
polynucleotide. 

FIG. 26 B shows one embodiment of the P30-6 protein, FIG 26 A shows one embodiment of the p30-6 
polynucleotide. 

FIG. 27 B shows one embodiment of the P30-7 protein, FIG 27 A shows one embodiment of the p3<K7 
polynucleotide. 

FIG. 28 B shows one embodiment of the P30-S protein, FIG 2S A shows one embodiment of the p30-8 
polynucleotide. 

FIG. 29 B shows one embodiment of a portion of me P30-9 protein, FIG 29 A shows one embodiment of 
the p30-9 polynucleotide encoding such polypeptide. 

FIG. 30 B shows one embodiment of a portion of the P30-10 protein, FIG 30 A shows one embodiment of 
the p30-10 polynucleotide encoding such polypeptide. 

FIG. 31 depicts the amino acid sequences alignment of seven £ chcfeewis OMP-ls and Cowdria 
ruminanthtm MAP-1. Aligned positions of identical amino acids with OMP-1F are shown with dots. The sequence 
of C mminanthan MAP-1 is from the report of Van Vliet et al (1994) Molecular cloning, sequence analysis, and 
expression of the gene encoding the immunodominant 32-kilodalton protein of Cowdria mmmantfum. Infect 
Immun. 62:1451-1456. Gaps indicated by dashes were introduced for optimal alignment of all proteins. Bars 
indicates semivariable region (SV) and three hypervariable regions (HV1, HV2, and HV3). 

DETAILED DESCRIPTION OF THE INVENTION 
Isolated Polynucleotides Encoding OMP-l.OMP-lA. OMP-1B. OMP-1C OMP-1D OMP-1F and the OMP from 
E.Canis 

In one aspect, the present invention, provides isolated polynucleotides that encode the outer membrane 
proteins; OMP-I (orp28), OMP-1B, OMP-tC, OMP-1D, OMP- IE, OMP-1F, OMP-1A, OMP-IR, OMP-1S, OMP- 
IT, OMP -1U, OMP-1V, OMP-1W, OMP-lX OMP-1Y and OMP-1Z from E. chafeensis and the outer membrane 
proteins P30, P30-a, P-30-l, P30-2, P30-3, P30-4, P30-5, P30-6, P30-7, P30-8, P30-9, and P30-10 from E. Canis or 
an immunogenic fragment thereof 

The polynucleotide is single stranded or double stranded. The polynucleotide may be a DNA or RNA 
molecule, preferably a DNA molecule, and comprises a sequence which codes for the respective outer membrane 
protein. Preferably, the polynucleotide encodes at least me mature form of outer membrane protein. The 
polynucleotide optionally further comprises a leader sequence and encode an outer membrane preprotein that is 
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processed in the cell to form the mature protein; The pofynucleotide of the present invention may also be fused in 
frame to a marker sequence which allows for purification of the corresponding outer membrane protein. 

TheOMP-1 porynucfeotide encodes an OMP-l protein of E. cbafeensis having a molecular weight of about 
27.7 kDa and an amino acid sequence which is at least 85% homologous to the amino acid sequence shown in FIG. 
3B SEQ ID NO: _j Figure 3B shows one embodiment of die OMP-l protein, Figure 3A shows one embodiment of 
the OMP-l polynucleotide. The OMP-1B polynucleotide encodes an OMP-1B protein of E. cbafeensis having a 
molecular weight of about 28.2 kDa and an amino acid sequence which is at least 85% homologous to the amino 

acid sequence shown in FIG. 4B SEQ ED NO: ; Figure 4B shows one embodiment of the OMP-1B protein, Figure 

4A shows one embodiment of the OMP-1B polynucleotide. The OMP-1C polynucleotide encodes an OMP-1C 
protein of E. cbafeensis having a molecular weight of about 27.6 kDa and an amino acid sequence which is at least 
85% homologous to the amino acid sequence shown in FIG. 5B SEQ ID NO: _j Figure 5B shows one embodiment 
of the OMP-1C protein, Figure 5A shows one embodiment of the OMP-l C polynucleotide. The OMP-JD 
polynucleotide encodes an OMP-1D protein of E. cbafeensis having a molecular weight of about 28.7 kDa and an 
amino acid sequence which is at least 85% homologous to the amino acid sequence shown in FIG. 6B SEQ ID NO: 
__; Figure 6B shows one embodiment of the OMP-ID protein, Figure 6A shows one embodiment of the OMP-l D 
polynucleotide. The OMP-l E polynucleotide encodes an OMP-l E protein of E. cbafeensis having a molecular 
weight of about 27.8 kDa and an amino acid sequence which is at least 85% homologous to the amino acid 
sequence shown in FIG. 7B SEQ ID NO: _j Figure 7B shows one embodiment of the OMP-1E protein. Figure 7A 
shows one embodiment of the OMP-1E polynucleotide. The OMP-1F polynucleotide encodes an OMP-l F protein 
of E. cbafeensis having a molecular weight of about 27.9 kDa and an amino acid sequence which is at least 85% 
homologous to the amino acid sequence shown in FIG. 8B SEQ ID NO: _j Figure 8B shows one embodiment of 
the OMP-1F protein, Figure 8A shows one embodiment of the OMP-l F polynucleotide. The OMP-1A 
polynucleotide encodes an OMP-1A protein of E. cbafeensis having a molecular weight of about 29.6 kDa and an 
amino acid sequence which is at least 85% homologous to ihe amino acid sequence shown in FIG. 9B SEQ ID NO: 
_j Figure 9B shows one embodiment of the OMP-1A protein, Figure 9A shows one embodiment of the OMP-l A 
polynucleotide. The OMP-l R polynucleotide encodes an OMP-IR protein of E. cbafeensis having a molecular 
weight of at least 23 kDa and comprising an amino acid sequence which is at least 85% homologous to the amino 

acid sequence shown in FIG. 10B SEQ ID NO: ; Figure 10B shows one embodiment of a portion of the OMP-IR 

protein, Figure 1 OA shows one embodiment of the OMP-IR polynucleotide encc>dirig such polynucleotide. The 
OMP-1S polynucleotide encodes an OMP-1S protein of E. cbafeensis comprising an amino acid sequence which is 

at least 85% homologous to the amino acid sequence shown in FIG. 1IB SEQ ID NO: ; Figure 11B shows one 

embodiment of a portion of the OMP-1S protein, Figure II A shows one embodiment of the OMP-1S 
polynucleotide encoding such polypeptide. The OMP-1T polynucleotide encodes an OMP-1T protein of E. 
cbafeensis comprising an amino acid sequence which is at least 85% homologous to die amino acid sequence shown 

in F1G.12B SEQ ID NO: ; figure 12B shows one embodiment of a portion of die OMP-1T protein, Figure 12B 

shows one embodiment of a polynucleotide encoding such polypeptide. The OMP-lU polynucleotide encodes an 
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OMP-IU protein of E. chafeensis having a molecular weight of about 30.6 kDa and an amino acid sequence which 
is at least 85% homologous to amino acid sequence shown in FIG. 13B SEQ ID NO. _j Figure 13B shows one 
embodiment of the OMP-IU protein, Figure 13A shows one embodiment of the OMP-IU polynucleotide. The 
OMP-1 V polynucleotide encodes an OMP-1V protein of E. chafeensis having a molecular weight of about 28.0 kDa 
and an amino acid sequence which is at least 85% homologous to the amino acid sequence shown in FIG. MB SEQ 
ID NO: __j Figure 14B shows one embodiment of the OMP-1 V protein, Figure MA shows one embodiment of the 
OMP-1V polynucleotide. The OMP-1 W polynucleotide encodes an OMP-1W protein of R chafeensis having a 
molecular weight of about 28.8 kDa and an amino acid sequence which is at least 85% homologous to the amino 

acid sequence shown in FIG. 15B SEQ ID NO: ; Figure 15B shows one embodiment of the OMP-1 W protein, 

Figure 15A shows one embodiment of the OMP-IW polynucleotide. The OMP-1X polynucleotide encodes an 
OMP-1S protein of E. chafeensis having a molecular weight of about 27.8 kDa and an amino acid sequence which 
is at least 85% homologous to the amino acid sequence shown in FIG. I6B SEQ ID NO: _j Figure 16B shows one 
embodiment of the OMP-1X protein, Figure 16A shows one embodiment of the OMP-1X polynucleotide. The 
OMP-1 Y polynucleotide encodes an OMP-1 Y protein of R chafeensis having a molecular weight about 28.8 kDa 
and an amino acid sequence which is at least 85% homologous to the amino acid sequence shown in FIG. 17B SEQ 

ID NO: ; Figure 17B shows one embodiment of the OMP-1 Y protein, Figure 17A shows one embodiment of the 

OMP-1 Y polynucleotide. The OMP-1Z polynucleotide encodes an OMP-1Z protein of E. chafeensis comprising an 
amino acid sequence which is at least 85% homologous to the ammo acid sequence shown in FIG. 1 8B SEQ ID NO: 

; Figure 18B shows one embodiment of a portion of the OMP-IZ protein, Figure 18A shows one embodiment of 

an OMP-1Z polynucleotide encoding such polypeptide. 

The p30 polynucleotide encodes a P30 protein of E. canis having a molecular weight of about 28.8 kDa 
and an amino acid sequence which is at least 85% homologous to the amino acid sequence shown in FIG. 19B SEQ 

ID NO: ; Figure 19B shows one embodiment of the P30 protein, Figure 19A shows one embodiment of the p30 

polynucleotide. The p30A polynucleotide encodes a P30a protein of E. canis having a molecular weight of about 
29.1 kDa and an amino acid sequence which is at least 85% homologous to the amino acid sequence shown in FIG. 
20B SEQ ID NO: _; Figure 20B shows one embodiment of the P30a protein, Figure 20A shows one embodiment 
of the p30A polynucleotide. The p30-l polynucleotide encodes a P30-1 protein of E. cams having a molecular 
. weight of about 27.7 kDa and an amino acid sequence which is at least 85% homologous to the amino acid 

sequence shown in FIG. 21B SEQ ID NO: ; Figure 21B shows one embodiment of the P30-1 protein, Figure 21 A 

shows one embodiment of the p30-l polynucleotide. The p30~2 polynucleotide encodes a P30-2 protein of E. canis 
having a molecular weight of about 28.0 kDa and an amino acid sequence which is at least 85% homologous to the 

amino acid sequence shown in FIG. 22B SEQ ID NO: ; Figure 22B shows one embodiment of the P30-2 protein, 

Figure 22A shows one embodiment of the p30-2 polynucleotide. The p30-3 polynucleotide encodes a P30-3 protein 
of E. canis having a molecular weight of about 28.7 kDa and an amino acid sequence which is at least 85% 

homologous to the amino acid sequence shown in FIG. 23B SEQ ID NO: ; Figure 23B shows one embodiment of 

the P30-3 protein, Figure 23A shows one embodiment of the p30-3 polynucleotide. The p30-4 polynucleotide 
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encodes a P30-4 protein of E. canis having a molecular weight of about 28.0 kDa and an amino acid sequence 
whkh is at least 85% homologous to the amino acid sequence shown in FIG. 24B SEQ ID NO: __J Figure 24B 
shows one embodiment of the P30-4 protein, Figure 24A shows one embodiment of the p30-4 polynucleotide. The 
p30-5 polynucleotide encodes a P30-5 protein of E. canis having a molecular weight of about 29.4 kDa and an 
amino add sequence which is at least 85% homologous to the amino acid sequence shown in FIG. 25B SEQ ID NO: 
_j Figure 25B shows one embodiment of the P30-5a protein, Figure 25A shows one embodiment of the p30-5a 
polynudeotide. The p30-6 polynucleotide encodes a P30-* protein of E. canis having a molecular weight of about 
29.5 kE^ and an amino acid sequence which is at least 85% homologous to the amino acid sequence shown in FIG. 

26B SEQ ID NO: ; Figure 26B shows one embodiment of the P30-6 protein, Figure 26A shows one embodiment 

of the p30-6 pofynucleotide. The p30-7 polynucleotide encodes a P30-7 protein of E. canis having a molecular 
weight of about 29.9 kDa and an amino acid sequence which is at least 85% homologous to the amino acid 
sequence shown in FIG. 29B SEQ ID NO: _j Figure 29B shows one embodiment of the P30-7 protein, Figure 29A 
shows one embodiment of the p30-7 polynucleotide. The p30-8 polynucleotide encodes a P30-8 protein of E. canis 
having a molecular weight of about 30 J kDa and an amino acid sequence which is at least 85% homologous to the 
amino acid sequence shown in FIG. 28B SEQ ID NO. _j Figure 28B shows one embodiment of the P30-S protein, 
Figure 28A shows one embodiment of the p30-8 polynucleotide. The p30-9 polynucleotide encodes a P30-9 protein 
of E. canis comprising an amino acid sequence which is at least 85% homologous to the amino acid sequence shown 

in FIG. 29B SEQ ID NO. ; Figure 29B shows one embodiment of a portion of the P30-9 protein, Figure 29A 

shows one embodiment of the p30-9 polynucleotide encoding such polypeptide. The p30-10 polynucleotide encodes 
a P30-10 protein of E. canis comprising an ammo acid sequence which is at least 85% homologous to the amino 
acid sequence shown in FIG. 30B SEQ ID NO: _j Figure 30B shows one embodiment of a portion of the P30-10 
protein, Figure 30A shows one embodiment of the p3(M0 polynucleotide encoding such polypeptide. 

The polynucleotides encoding an E. chafeensis outer membrane protein or an E. canis outer membrane 
protein have a sequence that is at least 85%, preferably at least 90%, more preferably at least 95% homologous to or 
similar to the amino acid sequences shown in Figures 3B through 30B, and thus embrace polynucleotides encoding 
outer membrane proteins from different strains of E. chafeensis and E. canis. The polynucleotides encode an outer 
membrane protein whose conserved regions collectively are at least 90%, preferably at 95%, more preferably at 
least 97% homologous to the conserved regions of the amino acid sequences of die present invention. The outer 
membrane proteins of E. chafeensis and E. canis have six conserved regions, which are separated by one 
semivariable region and three hypervariable regions. The conserved regions of the outer membrane proteins OMP- 
1, OMP-1A, OMP-1B, OMPl-C OMP-1D, OMPI-F are depicted in Fig. 31. Preferably, the amino acid sequence 
of the outer membrane proteins of E. chafeensis and E. canis are at feast 30% divergent from the amino acid 
sequence of MAP-l. Such sequences include allelic, strain variants and other amino acid sequence variants (cg^ 
including "muteins" or "mutant proteins^ whether naturalJy-accurring or biosyntheticalry produced. As used 
herein, -amino add sequence homology- is understood to mean amino acid sequence similarity, and homologous 
sequences share identical or similar amino acids, where similar amino acids are conserved amino acids as defined by 



8 



WO 99/13720 PCI7US98/19600 
Dayoff et aL, Atlas of Protein Sequence and Structure; vol. 5, Supp. 3 r pp. 345-362 (M. O. Dayofif, ed, Nat'l 
BioMed. Research Fdn_, Washington D C 1978.) Thus, a candidate sequence sharing 85% amino acid sequence 
homology with a reference sequence requires that, following alignment of the candidate sequence with the reference 
sequence, 85% of the amino acids in the candidate sequence are identical to the corresponding amino acid in the 
reference sequence, or constitute a conserved amino acid change thereto. "Amino acid sequence identity" is 
understood to require identical amino acids between two aligned sequences. Thus, a candidate sequence sharing 
85% amino acM identity with a reference sequence requires mat, following alignment of the candidate sequence 
with the reference sequence, 85% of the amino acids in the candidate sequence are identical to the corresponding 
ammo acid in die reference sequence. 

As used herein, all homologies and identities arc calculated using die amino acid sequences shown in the 
cited Figure or SEQ ID NO as the reference sequence. Thus, to determine whether an amino acid sequence is 85% 

homologous to OMP-1, one uses the amino acid sequence shown in Fig. , SEQ ID NO: as a reference. 

Also as used herein, sequences are aligned for homology and identity calculations using the method of the 
software basic local alignment search tool in the BLAST network service (the National Center for Biotechnology 
Information, Bethesda, MD) which employs the method of Altschul, S. F, Gish, W„ Miller, W„ Myers, E. W. & 
Ljpman, D. J. (1990)7. Afol BioL 215, 403-410. Identities are calculated by the Align program (DNAstar, Inc.) In 
all cases, internal gaps and amino acid insertions in the candidate sequence as aligned arc ignored when making the 

homology/identity calculation. 

In another aspect, the present invention provides a nucleotide sequence encoding a polypeptide which 
comprises a fragment of the OMP1 protein, hereinafter referred to as "rP28" . The rP28 polypeptide weighs 
approximately 31 kDa and comprises all but of the first 5 amino acids of mature OMP-1 protein. The rP28 
polypeptide comprises the amino acid sequence extending from amino acid 6 through amino acid 251 of the amino 
acid sequence shown in Fig.1, SEQ ID NO. The present invention also embraces polypeptides where one or more 
of the amino acids in the sequence extending from amino acid 1 or 6 through amino acid 25 1 Fig. 1 are replaced by 
conservative amino acid residues. The present invention also relates to derivatives of rP28 that have an amino acid 
sequence identity of at least 85%, more preferably at least 90%, and most preferably of at least 95% with the amino 
acid sequence extending from amino acid 1 or 6 through amino acid 251 of the protein and which derivative binds 
to antibodies in sera from humans infected with E. chafeeasis. 

The polynucleotides are useful for producing the outer membrane proteins of E. chafeensis and E. canis. 
For example, an RNA molecule encoding the outer membrane protein OMP-lis used in a cell-free translation 
systems to prepare OMP-1. Alternatively, a DNA molecule encoding the outer membrane protein is introduced into 
an expression vector and used to transform cells. Suitable expression vectors include for example chromosomal, 
nonchromosomal and synthetic DNA sequences, e.g., derivatives of SV40, bacterial plasmids, phage DN As; yeast 
plasmids, vectors derived from combinations of plasmids and phage DNAs, viral DNA such as vaccinia, adenovirus, 
fowl pox virus, and pseudorabies. The DNA sequence is introduced into the expression vector by conventional 
procedures. 
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Accordingly, the present invention also relates to recombinant constructs comprising one or more of the 
polynucleotide sequences. Suitable constructs include, for example, vectors, such as a piasmid, pbagemid, or viral 
vector, into which a sequence that encodes the outer membrane protein has been inserted In the expression vector, 
the DNA sequence which encodes the outer membrane protein is operatrvely linked to an expression control 
sequence, Le^ a promoter, which directs mKNA synthesis. Representative examples of such promoters, include the 
LTR or SV40 promoter, the E jcoli be or tip, the phage lambda PL promoter and other promoters known to control 
expression of genes in prokaryotic or eukaryotic cells or m viruses. The promoter may also be the natural promoter 
of the outer membrane protein coding sequence. The expression vector also contains a ribosome binding site for 
translation initiation and a transcription terminator. Preferably, the recombinant expression vectors also include an 
origin of reputation and a selectable marker, such as for example, the ampicillin resistance gene of K coli to permit 
selection of transformed cells, Le. cells that are expressing the heterologous DNA sequences. The polynucleotide 
sequence encoding the outer membrane protein is incorporated into the vector in frame with translation initiation 
and termination sequences. Optionally, the sequence encodes a fusion outer membrane protein which includes an 
N-termmal of C-terminal peptide or tag that stabilizes or simplifies purification of the expressed recombinant 
product Representative examples of such tags include sequences which encode a series of histidine residues, the 
Herpes simplex glycoprotein D, or glutathione S-tr^ 

Polynucleotides which encode portions of the outer membrane proteins of E chafeensis and E. canus are 
useful as probes for isolating and identifying E. chafeensis genes and E. canis genes, particularly full-length genes 
from new strains or isolates of E. chafeensis and E. canis. 

The Outer Membrane Proteins of E. chafeensis and E. Canis 

In addition to the outer membrane r*oteins OMP-I, OMP-1B, OMP-1C, OMP-1D, OMP-1 E, and OMP- 
1F, OMP-1R, OMP-1S, OMP-1T, OMP-1U, OMP-IV, OMP-1W, OMP-1X, OMP-1Y, and OMP-1Z from E. 
chafeensis and the proteins P30, P30A, P30-1, P30-2, P30>3, P3(M, P30-5, P30-6, P30-7, P30-8, P30-9, and P30-10 
from E. Canis, the present inventions embraces non-natiirally occurring allelelic forms or derivatives of the outer 
membrane TOTteins, where one or more of the amino acids have been replaced by conservative amino acid residues, 
typically by using direct synthesis or recombinant techniques. 
Prejparmg the Outer Membrane Proteins 

The outer membrane proteins of the present invention are synthetically produced by conventional peptide 
synthesizers. The outer membrane proteins are also produced using cell-free translation systems and RNA 
molecules derived from DNA constructs mat encode the cuter membrane protein. Alteriiativery, the outer 
membrane protein is made by transfectmg host c^ expression vectc« that comprise a DNA sequence which 
encodes the outer membrane protein and then inducing expression of the outer membrane protein in the host cells. 

The outer membrane protein is expressed in suitable host cells, preferably bacteria, under the control of 
suitable promoters. Host cells are transformed with the expression vectors of this invention and cultured in 
conventional nutrient media. Such media optionally contains additional compounds, such as for example 
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compounds that induce promoters, such as for example isojm)pyH^I>" tlli< ^aIactosi& which induces the Lac 
promoter, or compounds, such as for example, ampiciilm, which allows for selection of transformants. 

Following transformation of the suitable host strain and growth of the host strain to an appropriate cell 
density, the cells are typically harvested by ceatrifugation, disrupted by physical or chemical means, and the 
resulting crude extract retained for father purification of the outer membrane protein. Such purification usually 
involves salting-out of the protein fraction, and one or more chromatography steps, including aqueous ton exchange 
chromatography, size exclusion chromatography steps, and high performance liquid chromatography (HPLC). 
Preparation of Antibodies . 

The isolated outer membrane proteins, particularly the recombinant forms of the outer membrane proteins, 
are used as immunogens to produce antibodies immunospecific for the corresponding protein. The term 
"mmiunospecific" means the antibodies have substantially greater affinity for the protein used as an immunogen 
man for other proteins. Such antibodies are generated using conventional techniques by administering the 
respective outer membrane protein or a portion thereof, Le~, the recombinant polypeptide, to an animal, preferably a 
nonhuman. collecting blood from the immunized animals and isolating the serum and/or the IgG fraction from the 
blood. Monoclonal antibodies are prepared by injecting animals with the immunogens, extracting antibody- 
producing B cells from the animal, fusing the B cells with a myeloma cells to produce hybridomas, obtaining the 
monoclonal antibodies from the hybridomas. 

Antibodies to the outer membrane proteins of R chafeensis and E. canis are useful research tools for 
identifying cells, particularly monocytes, infected with E. chafeensis or E. canis and for purifying the corresponding 
outer membrane protein of Exhafeensis or E. Canis from partially purified preparations by affinity chromatography. 
Such antibodies are also useful for identifying bacterial colonies, particularly colonies of genetically-engineered 
bacteria, that are expressing the major outer membrane protein . 
Diagnostic Method 

The present invention also provides a method for detecting antibodies to the E. chafeensis or Exanis in a 
sample of a bodily fluid from a patient. The method comprises providing an isolated outer membrane protein of E. 
chafeensis or E. canis, particularly a recombinant form of the isolated protein, contacting the outer membrane 
protein or polypeptide with a sample taken from the patient; and assaying for the formation of a complex between 
the outer membrane protein or polypeptide and antibodies in the sample. For ease of detection, it is preferred that 
the isolated protein or polypeptide be attached to a substrate such as a column, plastic dish, matrix, or membrane, 
preferably nitrocellulose. The sample may be a tissue or a biological fluid, including urine, whole blood, or 
exudate, preferably serum. The sample may be untreated, subjected to precipitation, fractionation, separation, or 
purification before combining with the isolated protein or peptide. Interactions between antibodies in the sample 
and the isolated protein or peptide are detected by radiometric, colorimetric, or fluorometric means, size-separation, 
or precipitation. Preferably, detection of the antibody-outer membrane protein complex is by addition of a 
secondary antibody mat is coupled to a detectable tag, such as for example, an enzyme, fluorophore, or 
chromophare. Formation of the complex is indicative of the presence of anti-E chafeensis or annVE canis antibodies, 
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cither IgM or IgG, in the patient Thus, the method is used to determine whether a patient is infected with E 
chafeensis or E. cams. 

Preferably, the method employs an enzyme-linked immunosorbent assay (EL1SA) or a Western 
immunoblot procedure . Such methods are relatively simple to perform and do not require special equipment as 
long as membrane strips are coated with a high quality antigen. Accordingly, it is more advantageous to use a 
recombinant form of the outer membrane protein of E. chafeensis or E. canis since such proteins, typically, are 
more pure and consistent in quality than a purified form of such protein. 
Immunogenic Composition 

The present invention also relates to immunogenic compositions comprising one or more of the isolated 
outer membrane proteins of E_ chafeensis and a pharmaceutically acceptable adjuvant and to immunogenic 
compositions comprising an isolated P30 protein of E. canis and a phannaceutically acceptable adjuvant, which, 
preferably, enhances the immunogenic activity of the outer membrane protein in the host animaL 

Preparation of a Polynucleotide which Encodes OMPt1(P28) 
A- Isolation of the Outer Membrane Proteins 

£ chafeensis Arkansas strain and K canis Oklahoma strain were cultivated in the DHS2 dog macrophage 
cell line and purified by Percoll density gradient centrifogation. Purified ehrlichiae (100 ug) were suspended with 
10 mM sodium phosphate buffer, pH 7.4, containing 011% Sodium //-lauroyl sarcosine (Sarkosyl) [Sigma, SL 
Louis, MOJ, 50 ug/ml each Dnasc I (Sigma) and Rnase A (Sigma), and 2J mM MgCl* After incubation at 37° for 
30 mm, the sample was separated by centrifugation at 10,000 x g for 1 h into the soluble supernatant and the 
insoluble precipitate. The insoluble pellet was resuspended 2 to 3 times with 0.1% Sarkosyl and centrifuged. The 
final pellet was analyzed by sodium dodecyl sulfete (SDS)-polyacrylamide gel electrophoresis (PAGE) and by 
electron microscopy. 

Transmission electron microscopy revealed that the purified ehrlichial fraction consists of a mixture of 
electron dense and light forms of EL chafeensis with slight disintegration of inner membrane . Ehrlichiae were not 
surrounded with the host inclusion membrane. Various sizes of membrane vesicles (< 1 um) without significant 
ribosomes or nuclear materials were observed in the Sarkosyl-insoluble fraction from the organism. Succinic 
dehydrogenase (inner membrane marker enzyme of gram negative bacteria) activities were at less than the detection 
limit (1 n moles / min / mg of protein) in the SarkosyKmsoluble fraction compared to approximately 10 n moles/ 
min / mg of protein in the PercoU-purified organisms, suggesting mat the insoluble fraction primarily consisted of 
the outer membrane of K chafeensis. 

Analysis of the SarkosyLsohibie, and insoluble fraction of £ chafeensis by SDS-PAGE suggested that 
proteins of 30-kDa range in the insoluble fraction represent the major outer membrane proteins of this organism. 
Analysis of the SarkosyJ-sohibie, and insoluble fraction of £ amis by SDS-PAGE suggested that proteins of 30- 
kDa range in the insoluble fraction represent me major outer membrane proteins of this organism also. K canis was 
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antigenicaUy cross reactive with K dwfeensis. These findings indicate that the 30-kDa range proteins represent the 
major outer membrane proteins of these two Ehrlichia spp. 

To improve resolution of the outer membrane proteins, proteins in the Sarkosyl-msoluble pellet prepared 
from 400 ug of purified K chcfeemis were separated by a reversed-disconunuous (Rd) SDS-PAGE (23-cm-long 
17% gel on top of 1 1-cm-long 12% gel). At least five proteins of 30-kDa range in £ chqfeema (P23, P25, P27, 
P28, and P29) were resolved from the Saikosyl-msoluble proteins. 
B. Cloning and sequencing of the p28 gene 

The portion of the membrane containing bound proteins was excised and analyzed with an Applied 
Biosystems protein sequencer (Model 470). Hie N-terminal amino acid sequence of P2S was determined as D P A 

GSGINGNFYSGK YMP, SEQ IN NO . Based on 6th to 12th amino acids of this sequence, a 

forward primer, FECH1, having the sequence: 5'- 

was designed Amino acids at the 1 to 5 positions of the N terminus of P28 were not included in this 

primer design. For insertion into an expression vector, a 14-bp sequence (underlined) was added at the 5" end of 
primer to create an EcoRl and a DamHl site. The reverse primer. RECH2, which includes a Not\ site at the 5* end 
for ligation into an expression vector had the sequence : 5*-AGCGGCCGCTTA(A/G)AA(T/C)A(C/G) (A/G)AA 
(CVT)CT T(C/G)C TC03\ SEQ ID NO . 

Genomic DNA of £ chafeensis was isolated from purified organisms. PCR amplification with FECH1 
and RECH2 primers was performed using a Perkin-Elmer Cetus DNA Thermal Cycler (model 480). A 0.8-kb 
amplified product was cloned in the pCRII vector of a TA closing kit, as described by the manufacturer (lnvitrogen 
Co n San Diego, CA). The clone obtained was designated pCRHp2& Bom strands of the inserted DNA were 
sequenced by a dideoxy-tennmation method with an Applied Biosystems 373 A DNA sequencer. 

The 0.8-kb DNA fiagmen t, cloned in pCRSp28 9 had an open leading frame (ORF) of 756 bp encoding a 
251-amino acid recombinant protein (including both PCR primer regions) with a molecular mass of 27,685 Da. The 
nucleotide sequence of the open reading frame, SEQ ID NO: , and the ainino acid sequence of the polypeptide of 
theOMP-1 protein, SEQ ID NO , are shown in Figs and , respectively. 

A DNA fragment comprising the p30 gene was prepared in a similar manner, Le„ by PCR amplification of 
genomic DNA of E. canis with the FECH1 and RECH2 primers. 

Preparation of Polynucleotides which encode OMP 1A.QMP1B,Q MPI-C.QMP-1D OMP-1F, andOMPl-E 
A/ Southern blot analysis. Genomic DNA extracted from the purified K chqfeensis (200 ng each) was digested 
with restriction endonucleases, dectrophoresed, and transferred to Hybond-N* nylon membrane (Amersham, 
Arhngton Heights, ILX by a standard method. The 0.8-kb p28 gene fragment from the clone pCRIIp25 was labeled 
with [a-^PjdATP by the random primer method using a kit (Boehringer Mannheim, Indianapolis, IN) and the 
labeled fragment was used as a DNA probe. Hybridization was rjerfcrmed at 60°C in rapid hybridization bufier 
(Amersham) for 20 h. The nylon sheet was washed in 0.1 x SSC (I x SSC containing 0.1 5M sodium chloride and 
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0.OI5M sodium citrate) with 1% SDS at 55°C and me hybridized probes were exposed to Hyperfilm (Amersham) at 
-80°C. 

Genomic Southern blot analysis with several restriction enzymes resulted in one or more DNA fragments) 
ofR chafeensis which hybridized to "P-labeled p28 gene probe. The restriction enzymes used did not cut within 
the p28 gene portion of the pCKKp28 insert Xba U Bgl II, and Kpn I produced two bands, Spe I generated three 
bands, and EcoK V and Pst I produced multiple bands with different densities. EcoK I generated a broad band of 
X5to 4kb. These p28 homologous genes are designated as omp-f (outer membrane protein- i) family. 
B. Cloning and sequencing of genomic copies of£ cha/eensis p28 gene. The EcoR I and Pst I fragments of DNA, 
detected by genomic Southern blot analysis as described above, were inserted into pBluescript II KS (+) vectors, 
and the recombinant plasmids were introduced into E coUDHScl Using the colony hybridization method with the 
^-labeled p28 gene probe, four positive clones were isolated from the transformant The positive clones were 
designated pEC2.6, pEC3.6, pPS2,6, and pPS3.6. These contained the ehrlichia! DNA fragments of 2.6-kb {EcoK 
I), 3.6 kb {EcoK I), 2.6 kb {Pst IX and 3.6 kb {Pst IX respectively. The inserts of the clones pEC3.6 and pPS2.6 
overlapped as shown in Fig . . . The overlapping area was further confirmed by PCR of E chafeemis genomic 
DNA with two pairs of primer sets interposing the junctions of the four clones. The 1.1- to L6-kb DNA fragments 
of/tfmUU-//7mfflI, Hmdm~EcoRl, or Xhol-EcoKl in the p£C2.6 and pEC3.6 were subcioned for sequencing. DNA 
sequencing was performed with suitable synthetic primers by dideoxy-termination method as described above. 

Four DNA fragments from 2.6 to 3.6 kb were cloned from the £coRI-digested and the i^rf-digested 
genomic DNA of E chafeensis by colony hybridization with radiolabeled p28 gene probe. The insetted DNA of the 
two recombinant clones, pEC3.6 and PPS2.6, were overlapped as shown in Fig. 7. Sequencing revealed one 5*- 
truncated ORF of 243 bp (designated omp-XA) and five complete ORF of 836-861 bp (designated omp-JB to omp- 
/FX which are tandemiy-arrayed and are homologous to the p28 gene (but are not identical), in the ehrlichial 
genomic DNA of 6,292 bp. The intergenic spaces were 581 bp between omp-IA and omp-lB and 260-308 bp 
among others. Putative promoter regions and ribosome-binding sites were identified in the noncodmg regions. 

Sequence analysis and GenBank accession number. 

Nucleotide sequences were analyzed with the DNASIS program (Hitachi Software Engineering Co., Ltd., 
Yokohama, Japan). A homology search was carried out with databases of the GenBank, Swiss Plot, PDB and PIR 
by using the software basic local alignment search tool in the BLAST network service (the National Center for 
Biotechnology Information, Bethesda, MD). Phy iogenetic analysis was performed by using the PHYLIP software 
package (version 3 5). An evolutional distance matrix, generated by using the Kimura formula in the PROTDIST, 
was used for construction of a phy Iogenetic tree by using the unweighted pair-group method analysis (UPGMA) 
(Felsenstein, J. 1989. PHYLIP-phylogeny inference package (version 33). Cladistics 5:164-166). The data were 
also examined using parsimony analysis (PROTPARS in PHYUPX A bootstrap analysis was carried out to 
investigate the stability of randomly generated trees by using SEQBOOT and CONSENSE in the same package. 
The nucleotide sequence of the p28 gene and its gene copies has been assigned GenBank accession numbers 
U72291 and AF021338, respectively. 
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Proteins of the R chafeensu omn-1 Family. 

Five complete omp-1 gene copies (pmp-JB to omp-IF) encode 279 to 287-amino acid proteins with 
molecular masses of 30,320 - 31,508 Da. Omp-1 A encodes an 82-amino add partial protein (9,243 Da) which lacks 
the N-tenninal region. The 25-amino acid sequence at the N-terniimis of OMP-1 B to OMP-1 F (encoded in omp-IB 
to omp-IF) is predicted to be a signal peptide because three cartoxyl-tennmal amino acids of the signal peptides 
(Ser-X-Ala in OMP-1B, Leu-X-Ser for OMP-C, and Ser-X-Ser for OMP-1D and OMP-IF) are included in the 
preferred amino acid sequence of signal peptidase at the processing sites proposed by Oliver .. The calculated 
molecular masses of the mature OMP-1B to OMP-IF from the predicted amino acid sequences are 28,181 Da for 
OMP-1B, 27,581 Da for OMP-1 C, 28,747 Da for OMP-1D, 27,776 Da for OMP-1E, and 27,933 Da for OMP-IF. 
The estimated isoelectric points are 4.76-5.76 in the mature OMP-1B to OMP-IF. An amino acid sequence in omp- 
/F gene (the 80m to 94th amino acids) was identical to the N-tenninal amino acid sequences of & chqfeensis native 
P23 protein as determined chemically, which indicates that P23 is derived from the omp-IF gene. Amino add 
sequences identical to the N-terminal sequences of P25, P27, and P29 were not found in those from omp-1 gene 
copies cloned in mis study. 

Alignment of predicted amino add sequences of the EL chafeemis OMP-1 family and Cowdria 
rvminanlium* revealed substitutions or deletions of one or several contiguous amino add residues throughout the 
molecules. The significant differences in sequences among the aligned proteins are seen in the regions indicated SV 
(semi variable region) and HV (hypervariable region) 1 to 3 in Fig 3L Computer analysis for hydropathy revealed 
mat protein molecules predicted from all omp-1 gene copies contain alternative hydrophilic and hydrophobic motifs 
which are characteristic of transmembrane proteins. The HVl and HV2 were found to locate in the hydrophilic 
regions. 

The ammo add sequences of 5 mature proteins without signal peptides (OMP-1C to OMP-IF and a P28) 
were similar to one another (71-83%) but the sequence of OMP-1B was dissimilar to those of the 5 proteins (45- 
48%). The amino add sequences of the 5 proteins showed an intermediate degree of similarity with mat of C 
ruminantium MAP- 1 (59-63%), but the similarity between that of the OMP-1B and the C nuninantium MAP-1 was 
low (45%). These relations are shown in a phylogenetic tree which was obtained based on the amino acid sequence 
alignment by UPGMA method in the PHYUP software package (Fig. 10). Three proteins (P28, OMP-1D, and 
OMP-IF) and two proteins (OMP-1C and OMP-1E) formed two separate clusters. The OMP-1B was located 
distantly from these two clusters. The C ruminantium MAP-1 was positioned between the OMP-1B and other 
members in the OMP-1 family. 
Preparation of a Recombin ant ftwm of OMP-1 and P30 

The 0.8-kb p28 gene was excised from the done pCRUp28 by EcdRI-Notl double-digestion, ligated into 
EcoSO-Nod sites of a pET 29a expression vector, and amplified in Escherichia coii BL2 1 (DE3)pLysS (Novagen, 
Inc, Madison, WI). The clone (designated pET29p28) produced a fusion protein with a 35-amino acid sequence 
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canicd from the vector at the N terminus. The amino acid sequence of the OMP-1 portion of the fusion protein is 
depicted in Fig. [ . 

An expression vector comprising the p3 0 gene was used to prepare the recombinant form of P30. 
The following examples are for purposes of illustration only and are not intended to limit the scope of me 
claims which are appended hereto. 

Preoaralion of ant! rP28 (anti-O MPn antibodv . 

The (r) P28 antigen was prepared by excising the gel band corresponding to the rP28 in SDS-PAGE, 
mincing the band in phosphate-buffered saline (PBSX pH 7.4, and mixing with an equal volume of Freund's 
incomplete adjuvant (Sigma). The rP28 mixture (1 mg of protein each time) was subcutaneously injected into a 
tabbit every 2 weeks four times. A serum sample was collected from the rabbit to provide the aiu>rP28 antibody 

The anti-rP28 antibody was examined by western immunoblots analysis. The results indicated that the 
rabbit antj-rP28 antibody recognized not only rP28 (31 kDa) and P28, but alsaP29 and P25 of £ chafeemis and 
P30 of EL canis. These results indicate that P28 shares antigenic epitopes with P25 and P29 in K chafeemis and 
P30of£ ( 



Example 1 . Assaying for the presence of anti-OMP-1 antibodv in a Patient 

Convalescent-phase serum from a patient with clinical signs of human ehriichiosis was used. Western blot 
analyses using the rP28 protein as antigen was performed with 1:1,000 dilutions of this serum. Alkaline 
phosprtatase^jugated affinity-purified anti-human, anti-rabbit or anti-mouse immunoglobulin G (Kirkegaard & 
Perry Laboratories, Inc^ Gaithersburg, MD) were used at a 1:1,000 or 1:2,000 dilution as secondary antibodies. 
Results indicated that serum from a patient with clinical signs of human ehriichiosis reacted strongly to rP28 (31 
kDa). 

Example 2 AssavmE for the presence of anti-OMP- 1 antibodv in a Patient 

Convalescent-phase serum from a patient with clinical signs of human ehriichiosis was reacted with the 
rP30 protein ofExanis as described in Example L The serum reacted strongly to rP30. These results indicate the 
rP30 is useful for diagnosing an infection with E. chafeensis in human patients. 
Examples Identifying E. cAafeensis-infec ted cells using anti-rP 28 antibody 

K chq^ensis-mfected DH82 cells were sonicated and centrifuged at 400 x g for 1 0 min. The supernatant 
was then centrifuged at 10,000 x g for 10 min to obtain ehrlichia-enriched pellet The pellet was resuspended and 
incubated with rabbit antwP28 antibody or normal rabbit serum (1:100 dilution) at 37°C for Ih in PBS containing 
1% bovine serum alfcimin (BSA-PBS). After washing, the ehrlichiae was incubated with gold-conjugated protein G 
(20 nm), Sigma) at 130 dilution for 1 h at room temrjerature in BSA-PBS. After washing again, the specimen was 
fixed with 1.25% formaldehyde, 25% ghrtaraldehy de, and 0.03% trinitrophenol in 0.1 M cacodylate buffer (pH 7.4) 
for 24h and postfixed in 1% osmium-1.5% potassium femcyamde for 1 h (34). The section was then embedded in 
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PolyBed 812 (Polysciences, Warraington, Pa). Be specimen was otaathin sectioned at 60 nm, stained with uranyl 
acetate and lead citrate, and observed with a Philips 300 transmission eJectron microscope at 60 kV. 

Transmission immunoelectron microscopy with colloidal gold^mjugated protein G and rabbit anti-rP28 
antibody revealed gold particles bound to E. chafeemis surface. The distribution of the particles was random, dose 
to the surface, and appeared as if almost embedded in the membrane, suggesting that the antigenic epitope protrudes 
very little from the lipid Mayer. Nonetheless, the antigenic epitope was surface-exposed, and thus, could be 
recognized by rabbit antHP28 antibody. No gold particles were observed on host cytoplasmic membraneor£ 
chtfeensis incubated with normal rabbit serum. 
Example 4. i mmimfratin n of mice and K chakemis challenge. 

The rP28 band in SDS- PAGE was excised, minced, and mixed with an equal volume of Freund's 
incomplete or complete adjuvant Nine BALB/c male mice (6 weeks Old) were divided into two groups. Five mice 
were intraperitoneal^ immunized a total of four times at 10-day intervals; twice with a mixture of the minced gel 
with the TP28 (30 to 40 pg of protein per mouse each time) and incomplete adjuvant, and twice with a mixture of 
the recombinant protein (the same amount as before) and complete adjuvant. Four mice were intraperitoneally 
injected with a mixture of the minced gel without protein and the respective adjuvants. For ehrlichia-cbaUenge, 
approximately 1 x 1 0 7 DH82 cells heavily-infected with E. chafeemis were disrupted by sonication in serum-free 
DMEM (GIBCO-BRL) and centrifuged at 200 x g for 5 min. The supernatant was diluted to a final volume of 5 ml, 
and 03 ml was inoculated intraperitoneally into each mouse 10 days after the last immunization. Before challenge, 
all 5-immunized mice had a titer of 1:160 against K chefeensis antigen by IFA and all 4-iwntamunized mice were 
negative. 

At day 5 post-challenge, approximately 1 ml of blood was collected in an EDTA tube from each mouse 
and protection was assessed by PCR detection of £ cka/eerms 16S rDNA in the buffy coat of the collected blood. 
K chqfeemis could not be reisobted in cell culture at day 10 postinfection . Day 5 post challenge is the optimum 
time at which establishment ofehriichial infection can be examined by PCR without the influence of residual DNA 
from the ehrlicfaiae used as the challenge before the spontaneous clearance of organisms take place. The E. 
chafeensis-spxibc DNA fragment was observed in all noiunimunized mice but not in any immunized mice, 
indicating that immunization of rP28 apparently protects mice from ehriichial infection and indicating that the P28 
is a potential protective antigen 

Examples Assaying for the presence of anti-P30 antibody in Does 

The rP30 protein was used as an antigen in a Western immunoblot analysis and dot blot analysis to detect 
the presence of antibody to E. canis m serum from E-canis infected dogs. The results of the Western immunoblot 
analysis indicated that reactivity of the sera with rP30 was stronger than the reactivity that was observed when 
purified E.canis was used as antigen. The results of the dot blot assay indicated that rP30 is a useful and sensitive 
tool for serodiagnosis of canine ehrlichiosis. 
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CLAIMS 

What is claimed is: 

L An isolated polynucleotide encoding an outer membrane protein of EL chafeensis or an immunogenic 
fiagment thereof; wherein the outer membrane protein is selected from the group consisting of OMP-1, OMP-1B, 
OMP-1Q OMP-1D. OMP-1E, OMP-1F, OMP-IR, OMP-1S, OMP-1T, OMP-1U OMP-1V, OMP-1W OMP-1X 
OMP-1 Y, and OMP-1Z. 

2. The isolated polynucleotide of claim 1 wherein me polynucleotide encodes an OMP-1 protein comprising a 
sequence which is at least 85% homologous to the amino acid sequence shown in Fig. 3B, SEQ. ID NO _ 

3. The isolated polynucleotide of claim 1 wherein the polynucleotide encodes an OMP-1B protein comprising 
a sequence which is at least 85% homologous to the amino acid sequence shown in Fig. 4B, SEQ. ID NO . 

4. The isolated polynucleotide of claim 1 wherein the polynucleotide encodes an OMP-1C protein comprising 
a sequence which is at least 85% homologous to the amino acid sequence shown in Fig^B, SEQ. ID NO . 

5. The isolated polynucleotide of claim 1 wherein the polynucleotide encodes an OMP-1 D protein comprising 
a sequence which is at least 85% homologous to the amino acid sequence shown in Fig. 6B , SEQ. ID NO . 

6. The isolated polynucleotide of claim 1 wherein the polynucleotide encodes an OMP-1E protein comprising 
a sequence which is at least 85% homologous to the amino acid sequence shown in Fig. 7B, SEQ. ID NO . 

7. The.isolated polynucleotide of claim I wherein the polynucleotide encodes an OMP-1F protein comprising 
a sequence which is at least 85% homologous to the amino acid sequence shown in Fig. 8B, SEQ. ID NO 

8. The isolated polynucleotide of claim I wherein the polynucleotide encodes an immunogenic fragment of 
the OMP-1 protein, said fragment comprising a sequence which is at least 85% homologous to the amino acid 
sequence extending from amino acid 6 through amino acid 251 as shown in Fig. 1, SEQ. ID NO . 

9. An isolated polynucleotide encoding an outer membrane protein of E. canis or an immunogenic fragment 
thereof; wherein the outer membrane protein is selected from the group consisting of P30, P30-A, P30-1, P30-2, 
P30-3, P30-4, P30-5, P30-6, P30-7, P30-8, P30-9, P30-10. 

10. The isolated polynucleotide of claim 9 wherein said P30 protein comprises a sequence which is at least 
85% homologous to the ammo acid sequence shown in Fig, 19 SEQ ID NO. 

11. An isolated outer membrane protein of E. chafeensis or an immunogenic fiagment thereof, wherein said 
protein is selected from the group consisting of OMP-l, OMP-1B, OMP-1C, OMP-1D, OMP-1E, OMP-1F, OMP- 
1R, OMP-1S, OMP-1T, OMP-1U OMP-1 V, OMP-1 W OMP-1X OMP-1Y, and OMP-1Z. 

12. Hie isolated OMP-1 protein of claim 1 1, wherein said protein comprises a sequence which is at least 85% 
homologous to the amino acid sequence shown in Fig, 4B, SEQ. ID NO . 

13. The isolated OMP- 1 B protein of claim i 1 wherein said protein coniprises a sequence which is at least 85% 
homologous to the amino acid sequence shown in Fig, 5B, SEQ. ID NO . 

14. The isolated OMP-1 C protein of claim 1 1 wherein said protein comprises a sequence which is at least 85% 
homologous to the amino acid sequence shown in Fig. 6B, SEQ. ID NO . 
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1 5. The isolated OMP-1D protein of claim 1 1 wherein said protein comprises a sequence which is at least 85% 
homologous to the amino acid sequence shown in Fig. 7B, SEQ. ID NO . 

16. The isolated OMP-1E protein of claim 1 1 wherein said protein comprises a sequence which is at least 85% 
homologous to the amino acid sequence shown in Fig. 8B, SEQ. ID NO . 

17. Hie isolated OMP- 1 F protein of claim 1 1 wherein said protein comprises a sequence which is at least 85% 
homologous to the amino add sequence shown in Fig^9B, SEQ. ID NO . 

18. The isolated inununogenic fragment of the OMP-1 protein of claim 11, said fragment comprising a 
sequence which is at least 85% homologous to the amino acid sequence extending from amino acid 6 through amino 
acid 25 1 as shown in Fig. 1 , SEQ. ID NO . 

19. An isolated outer membrane protein of E. canis or an immunogenic fragment thereof; wherein the outer 
membrane protein is selected from the group consisting ofP30, P30-A, P30-1, P30-2, P30-3, P30-4, P30-5, P30-6, 
P30-7, P30-8, P30-9, P30-10. 

20. The isolated P-30 protein of claim 19 wherein said protein comprises a sequence which is at least 85% 
homologous to the ammo acid sequence shown in Fig 19, SEQ ID NO. . 

21. A method for diagnosing an infection with E. chafeensis in a patient comprising the steps of: 

(a) providing a serum sample from the patient; 

(b) providing an outer membrane protein selected from the group consisting of a protein of claim 
1 1, a protein of claim 19, and mixtures thereof, 

(c) contacting me serum sample with the outer membrane protein; and 

(d) assaying for the formation of a complex between antibodies in the serum sample and 
the outer membrane protein, wherein formation of said complex is 

indicative of infection with E. chafeensis. 

22. A method for diagnosing an infection with E. canis in a Canidae patient comprising the steps of: 

(a) providing a serum sample from the patient ; 

(b) providing an outer membrane protein of claim 19; 

(c) contacting the senim sample with the outer membrane protein; and 

(d) assaying for the formation of a complex between antibodies in the serum sample and 
the outer membrane protein, wherein formation of said complex is 

indicative of infection with E. canis. 
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10 20 30 40 50 60 

ATGRATTACA AAAAAGTTTT CATRACAAGT GCATTGATAT CATTAATATC TTCTCTACCT 
70 80 90 100 110 . 120 

GGAGTRXCAX TTTCCGACCC AGCAGGTAGT GGTATTAACG GTAATTTCTA C3VTCAGTGGA 
130 140 150 160 170 180 

AAATACATGC CflAGTGCTTC GCATTTTGGA GTATTCTCTG CTAAGGAAGA AAGARAIACA 

■ 200 210 220 230 240 

ACAGTTGGAG TGTTTGGflCT GAAGCAAAAT TGGGACGGAA GCGCAATATC CAACTCCTCC 
250 260 270 280 290 300 

ccaaacGftTG tattcactgt ctcaaattat tcatttaaat aiga&aacaa ccccttttta 

310 320 330 . 340 . 350 360 

GGTTTTGCAG GAGCTATTGG TTACTCAATG GATGGTCCAA GAATAGAGCT TGAAGTATCT 

370 380 390 400 410 420 

TATGAAACAT "GATGTJVftA AftATCAAGGT AACAATTATA' AGAATGAAGC ACATAGATAT 

430 " 8 450 460 47X» 480 

TGTGCTCTAT CCCATAACTC AGCAGCAGAC ATGAGTAGTG CAAGTAATAA I Tt TGTCITT 

490 500 510 520 530 540 

CTAAAAAATG AAGGATTACT TGACATATCA TTTATGCTGA ACGCATGCTA TGACGTAGTA 

550 560 570 580 590 600 

GGCGAAGGCA TACCTTTTTC TCCTTATATA TGCGCAGGTA TCGGTACTGA TTTAGTATCC 

610 6 20 630 640 650 660 

ATGTTTGAAG CTACAAATCC TAAAATTTCT TACCAAGGAA AGTTAGGTTT AAGCTACTCT 

670 680 690 700 710 720 

ATAAGCCCAG AAGCTTCTGT GTTTATTGGT GGGCACTTTC ATAAGCTAAT AGGGAACGAA 

730 740 750 760 770 780 

TTTAGAGATA TTCCTACTAT AATACCTACT GGATCAACAC TTGCAGGAAA AGGAAACTAC 

790 888 S10 820 830 840 

CCTGCAATAG XAATACTGGA TGTATGCOAC TTTGGAATAG ' AACTTGGAGG AAGGTTTGTA 

lpwm . 850 860 878 880 890 900 

TTCTAA 



Fig. 3 A 

. 1° 20 30 40 50 60 

MSXKKWITS A I.TS LI S S LP GVSFSDPAGS GUJGNFYISG KXMPSASHFG VFSAKEEPJJT 

' 70 80 90 180 118 120 

TVGVFGLKQH WDGSAISHS3 PNDVFTVSNY SFKXEHNPFL GFAGAIGYSH DGERIELBV3 

"° "0 150 160 170 180 

TfETFDVKHQG NHTKHEAHRY CALSHHSAAD HSSASHHFVF LKNEGLLDIS FMLHACYDW 

198 200 210 220 230 240 

GEGIPFSPYI CAGIGTDLV3 MFEATHPKIS YQGKLGLSYS ISPEASVFIG G HFHKVIGSE 

250 260 270 280 290 300 
FRDIPTIIPT GSTLAGKGHY PAIVTLDVCH FGIET-GGREV F 

Fig.3B 
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10 20 30 40 50 60 

ATGAATTACA AGAAAATTTT TGTAAGCAGT GCAXTAATTT CATTAATGTC AAXCTTACCT 
70 80 90 100 110 120 

TACCAATCTT TTGCAGAXCC TGTAACTTCA AATGATACAG GAATCAACGA CAGCAGAGAA 
130 140 150 160 1?0 180 

GGCTTCTACA TTAGTCTAAA GTATAATCCA AGCATATCAC ACTTCAGAAA ATTCTCAGCT 
190 200 210 220 230 240 

GAAGAAGCTC CCATCAATGG AAMACTTCT ATCACTAAAA AGGTTTTCGG GCTGAAAAAA 

250 260 270 280 290 300 

GACGGAGATA TAGCACAATC TGCGAATTTT AACAGGACAG ATCCAGCCCT CGAGTTTCAG 
310 320 330 340 350 360 
AAIAACCTAA TATCAGGATT CTCAGGAAGT ATTGGTTATG CTATGGAXGG GCCAAGAATA 
370 380 390 400 410 420 
GAACTTGAAG CTGCATACCA AAAATTTGAT GCAAAAAAXC CTGACAACAA TGACACTAAT 
430 440 450 460 470 480 
AGCGGTGACT ACTATAAATA CTTTGGACTA TCTCGTGAAG ACGCAATAGC AGAIAAGAAA 
490 500 510 520 530 540 
TATGTTGTCC TTAAAAATGA AGGCATCACT TTTATGTCAT TAATGGTTAA CACTTGCTAT 
550 560 570 580 590 600 
GACATTACAG CTGAAGGACT ACCTTTCATA CCGTATGCAT GTGCAGGTGT AGGAGCAGAC 
610 620 630 640 650 660 
CTTATAAACG TATTTAAGGA TTTTAATTTA AAATTCTCAT ACCAAGGGAA AATAGGTATT 
670 680 690 700 710 720 
AGCTATCCAA TCACACCAGA AGTTTCCGCT TTTATTGGAG GATACTACCA CGGAGTTAIA 
730 740 750 760 770 780 
GGAAATAATT TTAACAAAAT ACCTGTAATA ACAGCTGTAG TATTAGAAGG AGCTCCTCAA 
790 800 810 820^ 830 840 
ACCACATCTG CGCTAGTAAC TATTGACACT GGATACTTTG GCGGAGAAGT TGGAGTAAGG 
850 860 870 880 890 900 
TTCACCTTCT AG • - • • ' 

Fig.4A 

10 20 30 40 50 60 

MNYKKXFVSS ALISLMSILP YQSFADFVTS NDTGH3DSRE GFYISVKXHP SISHFRKFSA 
70 80 90 100 110 120 

EEAPINGOTS ITKKVFGLKK DGDIAQSANF NRTDPALEFQ NHLI5GFSGS IGXAMDGPRI 

130 140 150 160 170 180 

ELBAAYQKFD AKHPDHNDTH SGDYYKYFGL SKEDAIADKK YWLKHEGIT EH3IMVHTCT 

190 200 210 220 230 240 . 

DIXAEGVPFI PYACAGVGAD LIHVFKDFNI. KFSYQGKIGX SYPITPEVSA FIGGYYHGVI 

250 260 270 280 290 300 
GBNFRKIFVX TPWLEG&PQ TTSALVTIOT GYFGGEVGVR FTF . • 

Fig.4B 
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10 20 30 40 50 60 

ATGAACTGCA AAAAATTTTT TATAACAACT GCATTGGCAT TGCCAATGTC TTTCTTACCT 
70 80 90 100 HO 120 

GGAATATTAC TTTCTGAACC AGTACAAGAT GACAGTGTGA GTGGCAATTT CTATATTAGT 

130 140 150 160 170 180 

GGCAAGTACA TGCCAAGTGC TTCTCATTTT GGAGTTTTCT CTGCCAAAGA AGAAAAAAAT 

190 200 210 220 230 240 

CCTACTGTCG CGTTGTATG6 TTTGAAACAA GATTGGAACG GTGl'TAGTGC TTCAAGTCAT 

250 260 270 280 290 300 

GCTGATGCGG ACTTTAATAA CAAAGGTTAT TCTTTTAAW ACGAAAACAA TCCATTTCTA 

310 320 330 340 350 360 

GG TT TTG CAG GAGCTATTGG TTATTCAATG GGTGGTCCAA GAMAGAGTT TGAAGTGTCC 

370 380 390 400 410 420 

TATGAAACAT TTGACGTGAA AAATCAAGGT GGTAATTACA AAAAXGATGC TCACAGATAC 

430 440 450 460 470 480 

TGTGCCTTAG AXCGTAAAGC AAGCAGCACT AATGCCACAG CTAGTCACTA CGTGCTACTA 

490 500 510 520 530 540 

AAAAATGAAG GACTACTTGA TATATCACTT ATGTTGAATG CATGCTATGA CGTAGTAAGT 

550 560 570 580 590 600 

GAAGGAATAC CTTTGTCXCC TTACATATGT GCAGGTGTTG GTACCGATTT AATATCCATG 

610 620 630 640 650 660 

TTTGAAGCTA TAAACCCTAA AATTTCTTAT CAAGGAAAGT TAGGTTTGAG TTACTCTATA 

670 680 690 700 710 720 

AACCCAGAAG CTTCTGTCTT TGTTGGTGGA CATTTTCATA AAGTTGCAGG TAATGAATTC 

730 740 750 760 770 780 

AGGGACATTT CTACTCTTAA AGCGTTTGCT ACACCATCAT CTGCAGCTAC TCCAGACTTA 

790 800 810 820 830 840 

GCAACAGTAA CACTGAGTGT GTGTCACTTT GGAGTAGAAC TTGGAGGAAG ATTTAACTTC 

850 860 870 880 890 900 
TAA 

Fig. 5A 

10 20 30 40 50 60 

MNCKKFFITT ALALPMSFLP GXLLSEFVQD DSVSGHFYIS GKYMPSASHF GVFSAKEEKN 
70 80 90 100 110 120 

PTVALYGLKQ DWNGVSASSH ADAD FMMK GY SFKYEKNPFL GFAGAIGTSM GGPRIEFEVS 

130 140 150 160 170 180 

YETFDVKKQG CTYKHDAHRT CALDRKASST NATA3HYVLL KHEGLLDISL MI^ftCTOWS 

190 200 210 220 230 240 

EGIPFSPYIC AGVGTDLISM FEAINPKXST QSKLGLSYSX NPEASVFVGG- HFHKVAGNEF 

250 260 270 280 290 300 
RD1STLKAFA TPSSAATPDL ATVTLSVCHF GVELGGRFHF 

Fig.SB 
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10 20 30 40 50 60 

ATGAACTGCG AAAAATTTTT TATAACAACT GCATTAACAT TACTAATGTC CTTCTTACCT 
70 80 9° 100 lio . 120 

GGAATATCAC TTTCTGATCC AGTACAGGAT GAGAACATTA GTGGTAATTT CTACATCACT 

130 140 150 160 170 180 

GGAAAGTRIA TGCCftAGCGC TTCGCftTTTT GGAGTTTTTT CTGCCAAGGA AGAAAGAAAT 

190 200 210 220 230 240 

ACAACAGTTG GAGTATTTGG AATAGAGCAA GATTGGGATA GATGTGTAAT ATCTAGAACC 

250 260 270 280 290 300 

ACTTTAAGCG ATATATTCAC CGTTCCAAAT TATTCATTTA AGTATGAAAA TAATCTATTT 

310 320 330 340 350 360 

TCAGGATTTG CAGGAGCTAT TGGCTACTCA ATGGATGGCC CAAGAATAGA GCTTGAAGTA 

370 380 390 400 410 420 

TCTTATGAAG CATTCGATGT TAAAAATCAA GGTAACAATT ATAAGAACGA AGCACATAGA 

430 440 «0 460 470 480 

TATTATGCTC TGTCCCATCT TCTCGGCACA GAGACACAGA TAGATGGTGC AGGCAGTGCG 

490 500 510 520 530 540 

TCTGTCTTTC TAATAAATGA AGGACTACTT GATAAATCAT TTATGCTGAA CGCATGTTAT 

550 560 570 580 590 600 

GATGTAATAA GTGAAGGCAT ACCTTTTtCT CCTTATATAT GTGCAGGTAT TGGTATTGAT 

610 620 630 640 650 660 

TTAGTATCCA TGTTTGAAGC TATAAATCCT AAAATTTCTT ATCAAGGAAA ATTAGGCTTA 

670 680 690 700 710 720 

AGTTACCCTA TAAGCCCAGA AGCTTCTGTG TTTATTGGTG GACATTTTCA TAAGGTGATA 

730 740 75 0 760 770 780 

GGAAACGAAT TTAGAGATAT TCCTACTATG ATACCTAGTG AATCAGCGCT TGCAGGAAAA 

790 800 810 820 830 840 

GGAAACTACC CTGCAATAGT AACACTGGAC GTGTTCTACT ' TTGGCATAGA ACTTGGAGGA 

850 860 070 880 890 900 

AGGTTTAACT TCCAACTTTG A 

Fig.6A 

10 20 30 40 SO 60 

MNCEKFFTTT ALTLIMSEXP GISLSDPVQD DNISGNFYIS GKYMPSASHF GVFSAKEERN 
70 80 90 100 HO 120 

TTVGVFGIEQ DWDRCVISHT TXSDIFTVPN YSFKYENHLF SGFAGAIGYS MDGPRIELEV 

130 140 150 160 170 180 

SYEAFDVKNQ GNNYKNEAHR YXALSHLLGT ETQIDGAGSA SVFLUJEGLL DKSFMLHAOf 

190 . 200 210 220 230 - 240 

DVXSEGIPFS PYICAGIGID LVSMFBAXNP KI3YQGKLGL 3YPISPEASV FTGGHFHKVI 

250 260 270 280 290 300 
GHEFRDIPTM IPSESALAGK GNYPAIVTLD VFYFGTELGG RFHFQL . 

Fig.6B 
6/31 



SUBSTITUTE SHEET (RULE 26) 



WO 99/13720 PCT/US98/19600 



10 20 30 40 50 60 

ATGAATTGCA AAAAAITTTT TAIAACAACT GCATTAGTAT CACIAATGTC CTTTCTACCT 
70 80 90 100 110 120 

GGAAIATCRT TTTCTGATCC AGTGCAAGGT GACAATATTA GTGGTAATTT CTATGTTAGT 

130 "0 150 160 170 180 

GGCAAGTRXA TGCCAAGTGC TTCGCATTTT GGCATGTTTT CTGCCAAAGA AGAAAAAAAX 

190 200 210 220 230 240 

CCTACTGTTG CATTGTATGG CTTAAAACAA GATTGGGAAG GGATTAGCTC ATCAAGTCAC 

250 260 270 280 290 300 

AATGAIAATC ATTTCAATAA CAAGGGTTAT TCATTTAAAT ATGAAAATAA CCCATTTTTA 

310 3 20 330 340 350 360 

GGGTTTGCAG GAGCTATTGG TTATTCAATG GGTGGTCCAA GAGTAGAGTT TGAAGTGTCC 

370 3 80 390 400 410 420 

TATGAAACAT TTGACGTTAA AAATCAGGGT AATAACTATA AAAATGATGC TCACAGATAC 

430 440 450 460 47.0 480 

TGTGCTTTAG GTCAACAAGA CAACAGCGGA ATACCTAAAA CTAGTAAATA CGTACTGTTA 

490 500 510 520 530 540 

AAAAGCGAAG GATTGCTTGA CATATCATTT ATGCTAAATG CATGCTATGA TATAATAAAC 

550 560 570 580 590 600 

GAGAGCATAC CTTTGTCTCC TTACATATGT GCAGGTGTTG GTACTGATTT AATATCCATG 

610 620 630 640 650 660 

TTTGAAGCTA CAAATCCTAA AATTTCTTAC CAAGGGAAGT TAGGTCTAAG TTACTCTATA 

670 _ 680 690 700 710 720 

AACCCAGAAG CTTCTGTATT TATTGGTGGA CATTTTCATA AGGTGATAGG AAACGAAXTT 

730 7 «0 750 760 770 780 

AGGGACATTC CTACTCTGAA AGCATTTGTT ACGTCATCAG CTACTCCAGA TCTAGCAATA 

790 800 810 820 830 840 

GTAACACTAA GTGTATGTCA TTTTGGAATA GAACTTGGAG GAAGGTTTAA CTTCTAA. . . 

Fig. 7A 



■ 10 20 30 40 50 60 

MNCKKTFITT ALVSLMSFLP GISFSDPVQG DNISGOTYVS GKYMPSASHF GMFSAKEEKN 
70 80 90 100 110- 120 

PTVALYGLKQ DWEGISSSSH NDNHFHNKGY SFKYENHPFL GFAGAIGYSM GGPRVEFBV3 

130 "0 ISO ' . 160 170 180 

XETE DVKHQG NHYKHDAHIUf CALGQQDHSG IPKTSKXVLL KSBGtLDISE HLHACYDIIH 

190 200 210 220 230 240 

ESIPLSPYIC AGVGTDLISM FEATNPKISY QGKLGLSY3I HPEASVFIGG PFHKVIGHKF 

250 . 260 270 280 " 290 300 
RDIPTLKAFV TSSATPDLAI VTLSVCHFGX ELGGRFHF J 

Fig.7B 
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10 20 30 40 50 60 

ATGAATTGCA AAAAATTTTT TATAACflACT ACATTAGTAT CGCTAATGTC CTTCTTACCT 
70 80 9 ° 100 110 120 

GGAATATCAT TTTCTGATGC AGTACAGAAC GACAATGTTG GTGGTAATTT CTATATCAGT 

"0 l 4 o 150 160 170 180 

GGGAAAIRTG TACCAAGTGT TTCACAITTT GGCCTnTTCT CTGCTARACA GGAAAGAAAT 

190 200 210 220 230 240 

ACAACaacCG GAGTATTTGG ATTAAAGCAA GATTGGGATG GCAGCACAAT ATCTAAAAAX 

250 260 270 280 290 300 

TCTCCAGAAA ATACATTTAA CGTTCCAAAT TATTCATTTA AATATGAAAA TAATCCATTT 
, . 31 ° 320 330 340 350 360 

CTAGGTTTTG CAGGAGCTGT TGGTTATTTA ATGAATGGTC CAAGAAIAGA GTTAGAAATG 

370 380 390 400 410 420 

TCCTATGAAA CATTTGATCT GAAAAACCAG GGTAATAACT ATAAGAACGA TGCTCACAAA 

™ 440 450 460 470 480 

TATTATGCTT TAACCCAIAA CAGTGGGGGA AAGCTAAGCA ATGCAGGTGA TAAGTTTGTT 

490 500 510 520 530 540 

TTTCTAAAAA ATGRAGGACT ACTTGATATA TCACTTATGT TGAATGCATG CTATGATGTA 

550 560 570 580 590 600 

ATAAGTGAAG GAATACCTTT CTCTCCTTAC ATATGTGCAG GTGTTGGTAC TGATTTAATA 
Trr „„ J „g 0 620 "O 640 650 660 

TCCATGTTTG AAGCTATAAA CCCTAAAATT TCTTATCAAG GAAAGTTAGG TTTGAGTTAC 

670 688 €90 700 710 720 

TCCATAAGCC CAGAAGCTTC TGTTTTTGTT GGTGGACATT TTCAIAAGGT GATAGGGAAT 

730 740 750 760 770 780 

GAATTCAGAG ATATTCCTGC TATGATACCC AGTACCTCAA CTCTCACAGG TAATCACTTT 
^ 790 800 810 820 .830 840 

ACTATAGTAA CACTAAGTGT ATGCCACTTT GGAGTGGAAC TTGGAGGAAG GTTTAACTTT 
m 850 860 8 ™ 880 890 900 

Fig.8A 



10 20 30 40 50 60 

MNCKKFETTT TLVSLMSFLP GISFSDAVQN DNVGGNFYIS GKYVPSVSHF GVFSAKQERN 
70 80 90 100 . 110 120 

TTTGVFGLKQ DWDGSTISKN SPEHTFNVPH YSFKYENHPF LGFAGAVGYL MNGPRIELEM 

130 140 150 160 170 180 ' 

STCETFDVKHQ GNHYKHDAHK YXALTHHSGG KLSNAGDKFV FLKNEGLLDI SLMLNACYDV 

190 200 210 220 230" 240 

ISEGIPF3PY ICAGVGTDLI SMFEAXHPKT SYQGKLGLSY SI3PEASVFV GGHFHKVTGH 

250 260 270 280 290 300 
EFRDIPAMIP STSTLTGHHP TIV T lSV C H F GVELGGRFHF 

Fig- 8B 
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10 20 30 40 50 60 

ATGGAAAATC TCATGAATAA GAAAAACAAA TTCTTTACAA TAAGTACAGC AATGGTATGC 
70 80 90 100 110 120 

TTATTGTTAT TACCTGGTAT ATCATTTTCA GAAACTAIAA ACAACAGTGC TAAAAAACAG 

130 140 150 160 170 180 

CCTGGGTTAT ATATCAGTGG GCAGTACAAA CCTAGTGTTT CAGTTTTTAG TAATTTTTCA 

190 200 210 220 230 240 

GTAAAAGAAA CTAATGTTCC CACAAAGCAG TTAATAGCAC TTAAAAAAGA CAXTAATTCT 

250 260 270 280 290 300 

GTTGCAGTTG GTAGTAATGC TACTACAGGT ATTAGCAATC CAGGTAATTT CACAATTCCT 

310 320 330 340 350 360 

TATACTGCAG AATTTCAAGA TAATGTTGCC AATTTCAATG GGGCTGTTGG TTACTCTTTT 

370 380 390 400 410 420 

CCTGATAGTC TAAGAATTGA AATAGAGGGA TTTCATGAAA AATTTGATGT CAAAAACCCT 

430 440 450 460 470 480 

GGAGGTTACA CACAAGTAAA AGATGCGTAC CGTTATTTTG CACTAGCACG TGATTTAAAA 

490 500 510 520 530 540 

GATGGCTTCT TTGAACCTAA AGCGGAAGAT ACAGGTGTTT ATCATACTGT TATGAAAAAT 

550 560 570 580 590 600 

GATGGATTAT CTATTTTATC TACTATGGTT AACGTCTGTT ACGATTTTTC TGTAGATGAA 

610 620 630 640 650 . 660 

TTACCAGTCT TACCTTAXAT ATGTGCAGGT ATGGGTAXAA ACGCCATAGA ATTCTTCGAC 

670 680 690 700 710 720 

G CTTT ACATG TAAAATTTGC TTACCAAGGC AAACTAGGTA TTAGCTATCA ACTATTTACT 

730 740 750 760 770 780 

AAAGTAAATT TATTCCTTGA TGGGTATTAC CATCAAGTAA TAGGCAATCA ATTCAAAAAC 

790 800 810 820 830 840 

TTAAACGTAA ACCATGTTTA CACACTTAAA GAATCTCCTA AAGTCACATC TGCAGTAGCT 

850 860 870 880 890 900 
ACACTTGACA TTGCATACTT TGGTGGCGAA GTTGGAATAA GATTCACATT TTAA 

Tig. 9 A 



*u *U *U DO OU 

MV CTJJ.LP GI SFSETIHNSA KKQPGLYISG QYKPSVSVFS HFSVKETHVP TKQLIALKKD 
- % 70 80 90 100 110 120 

•INSVAVGSNA TTGISHPGHF TIFYTAEFQD NVAHFNGAVG Y5FPDSLRIE IEGFHKKFDV 
130 140 150 # 160 ' 170 180 

KHPGGYTQVK DAYRYEALAR DLKDGFFEPK AEDTGVYHTV HKHDGLS1XS TMVHVCTDF3 
190 200 210 220 230 240 

VDELFVLFYI GAGHGU2AIE FFDALHVKFA YQGKLGISYQ LFTKVHLFLD GYYHQVTGBQ. 

250 260 * 270. ' 280 290 ' 300 
FKHLNVHHVY TLKESPKVTS AVAXLDIAYF GGE VGIR FTF * 

Fig.9B 
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10 20 30 40 so 60 

AXGATATATA AAGAAAAACT TACTAGAGTG GGAGAATATA TCTTAGCATA TTTATCATTT 

. 70 80 90 100 no 120 

ATTCTTTCTA CTTATATCTT TCTAGTGCTG GTAAATATTA TTAGATATAA CAGCCTTGCT 
° 140 150 160 170 180 

ATAXGTGTTA TCAGTCIACT AAGAACTAAT ATCTTTAACG TTAGCACAAA AAAATTAATA 
200 210 220 230 240 

AAACaTAAAT GTCGTGATAC TAAGTTTAGT AACATGAATT GTTATTTGTA CGGTAAACCG 

260 270 280 290 300 

TTAAATTTAC AAATTTTTTA TGGAATATTT TCCTTTATTA GAAACTTTCA AAA7AACACA 

310 320 330 340 350 360 

CXAAXAATTC CTAATGATAG TAAATGCGGC TTCTATACCA CGTTATGGGA TAATCCAGCA 

~~ 380 390 «0 410 420 

CTACATTAXA CATATACACT TACTGGCAGT GAGTACCGTA ATTTTTTTGA CATTCTATAT 
440 450 460 470 Attn 

GAAAACATTA TCTGTCAATG TAAATTACTT ATTAACTATA ACCGTTCTGT ATTAAACCAA 

™ 500 S1 ° 52 0 530 540 

CATAATAAAA ATACTCTCGT AATAATACCA ATACCTAATG CTAGAGAGTT CAGTAATGAA 

560 570 580 590 goo 
ATTCGAGTAA GGAATATATC AATAAATAAG GAAAGTTCTT ATGAGTGCTA A 

Fig. 10A 



10 20 30 40 50 60 
MIYKEKLTRV GEYILATLSF ILSTYIFLVL VNIIRYHSLA ICVXSIJJITN IFNVSTKKLI 

70 80 90 100 110 120 

KDKCRDTKFS NMNCYLYGKP LNLQIFYGIF SFIRNFQHNT LIIPNDSKCG ETCTTLHDNPA 

130 140 150 160 170 180 

LHYTYTLTGS EYRHFFDH.Y EHUCQCKLL IHYHRSVLNQ HNKHTLVHP IPNAREFSHE 

190 200 210 220 230 240 

IKVRNISIBK ESSYEC. ... .......... .......... 

Fig.lOB 
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10 20 30 40 50 60 

ATGAATAAAA AAAACAAGTT TATTATAGCT ACAGCATTGG TATATTTACT GTCATTACCT 

70 80 90 100 HO 120 

AGTGTATCGT TTTCAGAGGT TACAAACAGC AGTATTAAAA AACACTCTGG GTTATATATT 

130 140 150 160 170 180 

AGTGGACAAT ACAAACCAAG TGTTTCTGTT TTTAGTaGTT TCTCAATTAA AGAAACTAAC 

190 200 210 220 230 240 

ACTBXCftCAA AAAATCTTAT agcgttaaaa aaagatatta actctcttga agttaacgcc 

250 260 270 280 290 300 

GATGCTAGTC AAGGTATTAG TCATCCAGGA AATTTTACTA TACCTTATAT AGCAGCATTT 

310 3 20 330 340 350 360 

GAAGATAATG CTTTTAATTT CAACGCTGCT ATTGGTTACA TTACTGAAGG TCTAAGGATT 

370 380 390 400 410 420 

GAARTAGAAG GTTCCTATGA AGAATTTGAT GCTGAAAACC CTGGAGGTTA TGGTCTAAAT 

430 440 «0 460 470 480 

GATGCCTTTC GGTACTTTGC TTTAGCACGT GATATGGAAA GCAACAAGTT CCTACCAAAA 

490 500 510 520 530 540 

GCACAAAGCT CAC 



Fig. 11A 



- 10 20 30 40 50 60 

MNKKNKFIIA TAIATCLLSLP SVSFSEVTNS SIKKHSGLYI SGQYKPSVSV FSSFSIKETN 
70 . . 80 90 100 110 120 
TITKNLIALK KDINSLEVNA DASQGISHPG NFTIPYIAAF EDNAFNFHGA IGYITEGLRI 
130 140 150 160 170 180 
EXEGSYEEFD AENPGGYGLN DAFRYFALAR BMBSHKELPK AQSS 



Fig. 11B 



11/31 



SUBSTITUTE SHEET (RULE 26) 



WO 99/13720 



PCT7US98n96©0 



10 20 30 40 50 60 

TCTAGAAXAC ATGATGAAAA TTATGCTATT ACAACAAATA ATAAATTATC CATCGCATCT 
70 80 90 100 110 120 

ATTATGGTXA ACACCTGCTA TGATATTTCA AXTAAEAAIA CATCAAIAGT ACCGTATTTA 

130 140 150 160 170 180 

TGGACAGGCA TTGGTGAAGA TCTTGTAGGG CTTTTTA AXA CAATACATTT TAAACTTGCA 

190 200 210 220 230 24 0 

TATCAAGGGA AAGTTGGAAT GAGTTATTTG AXAAAXAACA AXATCCTATT ATTTTCTGAC 

250 260 270 280 290 300 

ATATATTATC ATAAAGTCAX GGGTAACAGA TTTAAAAATT TGTACATGCA ATATGTAGCT 

310 320 330 340 350 360 

GATCCTAATA TTTCTGAAGA AACTATACCT AXATTAGCAA AACTTGATAT ?GGTTATTTT 

370 380 390 400 410 420 
GGAAGTGAAA TTGGAATAAG GTTTATGTTT AACTAA 

Fig. 12A 



10 20 30 40 50 60 

SRIHDE1JYAI TTNNKLSIAS . IMVNTCYDIS HWTSIVPYI. CTGIGEDLVG ItESTXHFKLA 

70 80 * 90 100 . 110 120 
YQGKVGMSYL INNNXLLFSD IYYHKVMGNR FKNLYMQYVA DPNISEEMP ILAKLDIGYF 
130 140 150 160 170 180 
GSEIGXRFMF N 

Fig. 12B 
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10 20 30 40 50 60 
ATGACAAAGA AATTTAATTT TGTAAATGTT AIATTAACAT TTTTGTTATT TCTTTTCCCA 
70 60 90 100 110 120 
CTTAAGTCAT TTACAACAIA TGCAAATAAT AACACAATCA CTCAAAAAGT TGGATTGTAC 
130 140 150 160 170 . 180 
AXAAGTGGTC AAIATAAGCC AAGIAITCCX CATTTCAAGA A3TTTTCAGT AGAAGAAAAT 
190 200 210 220 230 240 
GRCAAAGTAG TAGATTTGAT AGGTCTTACA ACTGATGTTA CAIATATCAC AGAACATATA 
250 260 . 270 2>0 290 300 
TTACGAGATA ATACAAAATT CAACACTCAT TATATTGCAA AGTTCAAGAA CAATXTTAIA 
310 320 330 340 350 360 
AATTTCAGCA GTGCAATTGG TTATTATTCT GGGCAAGGAC CAAGGTTAGA AATAGAAAGC 
370 380 390 400 410 420 
TCTTATGGGG ATTTTGAIGT TGTAAATTAT AAAAATTATG CAGTACAAGA TGTTAATAGA 
430 440 450 460 470 480 
TATTTTGCTT TAGTACGTGA AAAAAATGGT TCAAATTTCT CTCCAAAACC ACATGAAACT 
490 500 510 520 530 540 
AGTCAACCCT CTGACAGTAA TCCTAAAAAG TCTTTTTATA CTTTAATGAA 6AATAATGGG 
550 560 570 580 590 600 
GTATTTGTTG CATCAGTAAT AATCAACGGT TGTTATGATT TTTCTTTTAA TAACACAACA 
610 620 630 640 650 660 
ATATCACCTT ACGTATGTAT AGGAGTTGGA GGAGATTTTA TAGAGTTTTT TGAAGTAATG 
670 680 690 700 710 720 
CATATCAAGT TTGCTTGCCA AAGTAAGGTT GGTATTAGCT ATCCAAIATC TCCCTCTATT 
730 740 750 760 770 780 
ACTATTTTTG CTGATGCACA TTATCACAAG GTCATAAATA ATAAAXTTAA CAACCTACAT 
790 800 810 820. 830 840 
GTTAAGTATT CATATGAACT TAAAAACTCA CCTACCATTA CCTCTGCAAC AGCCAAACTA 
850 860 870 880 890 900 
AACATTGAAT ATTTTGGTGG TGAAGTTGGG ATGAGATTTA TATTTTAA 

Fig. 13A 

10 20 30 40 50 60 

MXKKFHEVHV ILTFLLFLFP LKSFTTYANN NTITQKVGLY ISGQYKPSIP HFKNFSVEEH 
70- 80 90 100 110 120 

DKWDLIGLT TDVTOTEHI LRDNTKFHTH YIAKFKHHET NFSSAIGTZS GQGPKLEIES 

130 140 . 150 160 170 180 

SYGDFDWHY KHYAVQDVNR YFALVREKHG SHFSPKPHET SQPSDSHPKK SFTCU1KHBG 

190 200 .210 220. '230 240 

VFVASVIING CTOFSFHBTT ISPxVCIGVG GDFIEFFEVM HIKFACQSKV GISYPISPSI 

250 260 270 . 280 290 300 
TIEADAHYHK VHIHKFHKLH VKYSYELKNS* PTITSATAKL NIEYFGGEVG MHFIF 

Fig, 13B 
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10 20 30 40 50 60 

ATGAGCAAAA AAAAGTTTAT TACAATAGGA ACAGTACTTG CRTCTCTATT ATCATTCTTA 
' 70 BO 90 100 no " u 

TCTATTGAAT CCTTTTCAGC TATAAATCAT AATCAIACAG GAAATAACAC TAGTGGTATA 
330 140 150 ISO 170 1»" 

TMOTUaG GGOIGIAIAG ACCAGGAGTA TCCCATTTTA GCAATTTCTC AGTAAAAGAA 

190 200 210 220 230 

ACTAATGTTG AXACAAIACA ACXW3TAGGA TAIAAAAAAA GTGCGTCTTC TATCGATCCT 
250 260 270 280 290 300 

AACACTTATT CAAACTTTCA AGGTCCATAT ACTGTTACAT TTCAAGATAA TGCTGCTAGT 
310 320 330 340 350 350 

TTCAGTGGAG CAAITGGATA TTCTTACCCC GAAAGTCTAA GACTTGAACT TGAAGGTTCT 

370 380 390 400 410 4ZO 

TACGAAAAAT TTGATGTCAA AGATCCTAAA GACTACTCAG CAAAAGAIGC TTTTAGGTTT 
430 440 450 460 470 480 

TTTGCTCTAG CACGTAATAC GTCTACTACT GTTCCTGATG CTCAAAAAIA TACAGTTAXG 
490 500 510 520 530 54U 

AAGAATAATG GCTTATCTGT TGCATCAAXC ATGATCAATG GTTGTTATGA TCTATCXTTT 
550 560 570 580 590 600 

AATAATTTAG TCGTATCACC TTATATATGT GCAGGTATTG GTGAAGATTT CATTGAATTT 
610 620 630 640 650 660 

TTTGATACTT TGCACATTAA ACTTGCTTAT CAAGGAAAAC TAGGTATCAG TTATTACTTC 
670 680 690 700 710 720 

TTTCCTAAGA TTAATGTATT TGCTGGTGGG TACTATCATA GAGTTAXAGG GAATAAATTT 
730 740 750 760 770 78U 

AAAAATTTAA ATGTTAACCA TGTTGTTACA CTTGATGAAT TTCCTAAAGC AACTTCTGCA 
790 800 810 820. 830 840 

GTAGCTAGAC TTAATGTTGC TTATTTTGGT GGTGAAGCTG GAGTAAAGTT TACATTTTAA 

~ 980 ■ oSM) 



850 860 870 



Fig.l4A 

10 20 30 40 50 60 

MSKKKFITIG TVLASLLSEX SIESFSAIHH NHTGHBTSGI YTTGQYRPGV SHFSHFSVKE 
70 80 90 100 110" 120 

TNVDTIQLVG YKKSASSIDP HTYSHFQGFY TVTFQDHAAS FSGAIGYSYP ESLBLBLEGS 
130 140 150 160 170 180 

YEKFDVKDPK DYSAKDAFRF EALABOTSTT VPDAQKYTVM KHNGLSVAS1 HTHGCYDLSF 
190 200 210 220 230 240 

NHLW3PYIC AG1GBDFIEF FDTIBIKLAY QGKLGISYYF FPKIMVFAGG YYHRVXGHKF 
. 250 . 260 270 280 290. 300 

KHLNVNHWT LDEFPKATSA VATLNVAYFG GEAGVKFTF. .......... .......... 

Fig.l4B 
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10 20 30 40 50 60 

ATG>JGTGCTA aaaaaaagct ttttmaata gggtcagtct tagtatgttt agtgtcatac 

70 80 90 100 110 120 

TTACCTACTA AATCTTTGTC AAACTTRAAT AATATTAAIA AIAACACTAA GTGCACTGGG 
130 140 150 160 170 180 

CTATATGTCA GTGGACAATA TAAACCTACT GTTTCTCACT TTAGTAATTT TTCACTTAAA 
190 200 210 220 230 240 

GAAACTTATA CTGACACTAA AGAGTTA3CTA GGACTAGCAA AAGATATTAA GTCTAXTACA 
250 260 270 280 290 300 

GATATAACAA CAAATAAAAA ATTCAACATT CCTTATAACA CAAAATTTCA' AGAIAATGCT 

310 320 330 . 340 350 360 

GTTAGCTTCA GTGCAGCTGT TGGATAIATT TCCCAAGACA GTCCAAGGGT TGAGGTAGAA 

370 380 390 400 410 420 

TGGTCTTATG AAGAATTTGA CGTTAAAAAT CCTGGTAATT ACGTAGTAAG TGAAGCCTTC 

430 440 450 460 47J0 480 

AGGTATATTG CTTTAGCAAG AGGAATTGAT AATCTTCAAA AAIATCCTGA AACAAATAAG 

490 500 510 520 530 540 

TATGTTGTTA TAAAGAACAA TGGCTTATCT GTCGCATCCA TTATAATCAA TGGCTGTTAT 

550 560 570 580 590 600 

G ATTTTTCTT TAAACAATTT AAAAGTAXCA CCTTACATAT GCGTAGGGTT TGGTGGGGAC 

610 620 630 640 650 660 

AXTATAGAAT TTTTTAGTGC TGTAAGTTTT AAATTTGCTT ATCAAGGTAA GGTAGGTATC 

670 680 690 700 710 720 

AGTTATCCAT TAXTCTCTAA TATGATTATA TTTGCTGACG GATATTACCA TAAGGTCATA 

730 740 750 760 770 780 

GGAAATAAAT TTAACAATTT AAATGTTCAA CACGTTGTTA GTCTTAACAG TCATCCTAAG 

790 800 810 820 830 840 

TCTACTTTTG GAGTAGCTAC TCTTAAXGTT GAGTATTTCG GTAGTGAATT TGGGTTAAAA 

850 860 870 880 890 900 
TTTATATTTT AA. ••••••»•-• 

Fig. 15A 

10 20 30 40 50 60 

MSJVKKKLFII GSVLVCLVSY LPTKSLSNLN OTHNHTKCTG LYVSGQYKPT VSHFSHFSLK- 
70 80 90 100 110 120 

ETTTDTKBtL GtAKDIKSIT DITTHKKFHI FYNTKFQDNA VSFSAAVGYI SQDSPKVEVE 

. 130 140 ISO 160 170 u * 8 ° 

WSXEEEDVKH PGHYWSEAF RYIALARGID NLQKYFETHK XVVUUWGLS VA3IIIHGCX 
190 200 . 210 220 230 2 *° 
DFSINHLKVS PYICVGFGGD IXBFFSAVSP KFKfQGKVGI SYPLFSNMII EWfaxxiuvvl 
250 260 270 280 290 300 
GHKFNHLHVQ HWS1HSHPK STFAVftTLHV EYFGSEEGLK FIT 

Fig.158 
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10 20 30 40 50 60 

ATGAGTAAAA AAAATTTTAT TACAATAGGA GCAACACTTA TTCAIATGTT GTTACCTAAC 
70 80 90 100 110 12° 

AUKTcnrrc cagraactkp taacaataac actgaiaaac tttctgggtt aiaiataagt 

130 140 150 160 170 180 

GGGCAftlRIA AACCAGGGAX TTCTCATTTC AGCAAATTTT CAGTCAAAGA AATCTAIAAT 

190 200 210 220 230 240 

GAIAACATTC AACTAATTGG GTTAAG&CRC AACGCAAITT CIACTACTRC CCTTAftlBIT 

250 260 270 280 290 300 

AATACAGATT TCAAXA1CCC CTAIAAAGTA ACAXTTCAAA AIAACATTAC CAGCTTTAGT 

310 320 330 340 350 360 

GGRGCIAITG GTTAXTCTGA TCCCACAGGG GCAAGATTTG AGCTTGAAGG TTCIXKTGftft 

370 380 390 400 410 420 

GAATTTGATG TGACAGRTCC TGGAGACTGC TTAATAAAAG ATACCTATAG ATATTTCGCT 

430 440 450 460 470 480 

TTAGCTAGAA ACCCATCAGG TTCTAGCCCT ACCTCAAACA ACTATACTGT TATGAGAAAT 

490 500 510 520 530 540 

GATGGTGTTT CCATTACTTC TGTTATATTT AATGGCTGTT ATGACATCTT TTTAAAGGAT 

550 560 570 580 590 600 

TTAGAAGTAT CACCTTATGT ATGTGTTGGT GTAGGTGGAG ATTTTATAGA ATTTTTTGAC 

610 620 630 640 650 660 

GCATTACACA TTAAATTAGC ATACCAAGGC AAGTTAGGTA TCAATTATCA CTTATCGACT 

670 680 690 700 710 720 

CAAGCAAGCG TATTTATTGA TGGATATTAT CAXAAGGTTA TAGGAAATCA ATTCAACAAT 

730 740 750 760 770 780 
CTAAATGTTC AACACGTGGC TAGTACAGAT TTTGGACCTG TATACGCAGT AGCCACACTT 
790 800 810 820. 830 840 
AACATTGGTT ATTTTGGTGG TGAAATCGGA ATTAGACTTA CATTTTAA 

Fig. 16A 

10 20 30 .40 50 60 

MSKKNFITIG AXI.IHMLLPN 1SFPET1HHM TDKLSGMEXS GQYKPGISHF SKFSVKEIYH 
70 80 90 100 110 120 

DHIQLIGLRH HAISTSTLHI HTDFNIPYKV TFQNNITSF3 GAIGYSDPTG ARFELEGWE 

130 140 150 160 170 180 

EFDVTDFGDC LIKDTXKYFA 1ABHFSGSSB TSHHYTVMRH DGVSITSVIF NGCYDIFLKD 

190 200 - 210 220 230 f** 

t^VSPYVCVG VGGDFIBFFD ALHUCLAYQG KLGHKHLST QASVFIDGYY HK7IGNQFHH 

250 260 270 280 290 300 
LNVQHVASTD FGPVYAVATL NIGYFGGEIG IRUF .' 

Fig. 16B 
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" 20 30 



ATGAAXAAXA GAAAAAGTTT TTtWa GGTGCATCaJ TACTAGCAA^ CTTAITATTC 
/v 80 on 

110 120 
TTAA ACCTAGGTTA 
170 . 180 
rTTC AGTCAAAGAA 

^^^^C^^ ATACTACTCA ACTAGTTGGG CTTAAAAAGG AGAXCAGTGT CATAGGGAAC 



ACATCTGAG^ CCTCTTCIAC AGGAAAXGTA AGTAACCAIA CTTATTTTAA ACCTAGGTTA 
140 150 160 170 ion 

'AIAJCABTG GACAAXA2AG ACCAGGAGTT TCTCATTTTA' GCAAATTiTC AGTCAAAGAA 



2tt "0 i«> 290 300 

RATT TCAAGACAAT 
350 360 
HAG AATTGAAGTA 

^AGGCTTCTT ATGAAGAATT TGATGTTAAA AATCCAGAAG GATCTGCTAC AGACGCATAC 



ACTAAXATCA CAACCTACAC AAATTTraar m ltl| 290 . 300 

310 ^ TTTCCTTACA TTGCAGAATT TCAAGACAAT 

AU 320 330 340 

GGGC AATTGGATAC TTGTATTCCG 
380 390 400 



^^"^^tO AATTCGAXAC TTGTATTCCG AGAATTTTAG AATTGAAGTA 



—»™ CACTAGCACC TGCTATGGAT GGCACTA»£ AATCTAGtS TGATGAC^ 

«™S 'o«JS 

rtmC t B,Qn S TCa ™ «—S AGGAAXAGGA 

" ggagatttca tagagttttt taatgattta catgttaagt ttgctcatca aggcaaggta 

GGTATTAGTT ATTCIAIATC CCCTGAAGTA AGTTTATTTC TTAACGGATA TTACCATAAA 

CCTAAGTTCA CATCTGCAGT TGCTACArrr- BAwW.f.- 830 840 

850 MO OTACA ^ AATGTTGGGT ACTTTGGTGG CGAAATTGGA 

GIAA6ATTTA TATTTTAA . 88 ° 890 900 

Fig. 17A 



10 20 30 40 50 

„ — ™.«..„BL XISGQTOPGV 

80 - 90 inn 

-™J «msn« wiBo m-. Mtt-S'atahS 



• 20 30 4o 

-HHHKSFm GASLLASLLF TSEASSTGNV SHHTYFKPRL VISCQ^ SHFSKFSVKB 



140 . 150 ieo no 180 

WBDG LSISSVMING 

CWTLODXP VVP^CAGXO GDFTBFCNI^L HVKEAHQGKV GISVSISP* SLBX^ 
VTGUREKHLH VQHVSDLSDA PKFTSAVML . NVGYFGGEIG VRFIT. ™ ....... ^ 

Fig.l7B 
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WO 99/13720 PCT/US98/19600 

10 20 30 40 50 60 

TAGCAGCACT AAAAAACAGT TTGGGTTAIA TGTTAGTGGA CAACACCAGC CTAGTGTTTC 
70 80 90 100 110 120 

TATTTTTAGC AAXTTCTCAG TAAAGGAAAC TAATTTTCCT ACAAAGTATT CTAGCAGCTT 

130 140 150 160 170 . 180 

CTTAAAAAAA GACATTAATT CTGTCGAATP TGACGAIAGT GTTACTGCTG GCATTAGTTA 

190 200 210 220 230 . 240 

CCCACTTAAT TTCAGTACTC CTTATATAGC TGTATTTCAA GAXAAXKRT CTAATTTTAA 

250 260 270 280 290 300 

TGGCGCTAXT GGGTACACTT TT G T TGA AGG CCCAAGAATT GAAATAGAAG GTTCTTATGA 

310 320 330 . 340 350 360 

AGAATTCGAT GTCAAAGACC CTGGAAGATA TACAGAAATA CAAGATGCAT ACCGTTACTT 

370 380 390 400 410 420 

TGCTTTAGCA CGTGATATAG ACTCTATTCC TACTAGCCCA AAAAATAGAA CTTCACATGA 

430 440 450 ' 460 470 480 

TGGCAACAGT TCATATAAGG TATACCACAC TGTAATGAAA AATGAAGGAC TATCTATAAT 

490 500 510 520 530 540 

ATCCATTATG GTCAATGGCT GCTATGATTT TTCTTCAGAT AATTTATCAA TATTACCTTA 

550 560 570 580 590 600 

TGTATGTGGT GGTATAGGTG TAAATGCTAT AGAGTTTTTC GATGCATTAC ATGTTAAATT 

610 620 630 640 650 660 

CGCGTGTCAG GGTAAATTAG GTATTACTTA TCCATTATCT TCCAACGTTA GTTTATTTGC 

670 680 690 700 710 720 

TGGTGGATAT TATCACCAAG TAATGGGCAA CCAATTTAAA AATCTAAATG TTCAACATGT 

730 740 750 760 770 780 

AGCTGAACTT AATGACGCAC CCAAAGTTAC ATCTGCAGTA GCTACACTTG ACAXTGGGTA 

790 800 810 820, 830 840 
TTTTGGTGGT GAAATTGGAG CAAGGCTTAT ATTTTAA. . . 

Fig.l8A 

' 10 .20 30 40 . 50 60 

SSTKKQFGLY V5GQHQPSVS IFSHFSV KET NFPTKXSSSF LKKDXHSVEF DDSVTAGISY 
70 80 90 100 110 120 

PUIFSTFYIA VFQDNISNFN GAIGYTFVEG ERIEIEGSYE EFDVKDPGRY TEIQDAXHYF 

130 140 150 160 170 i80 

ALARDIDSIP TSPKHRTSHD GNSSYKVYHT VMKKEGLSH SIMVNGCYDF SSDHLSILF* 

190 200 210 220 230 240 

VCGGIGVSAI EFFDALHVKF ACQGKLGITY PLSSHVSLFA GGYYHQVMCai QFKHLHVQHV 

250 260 270 280 290 300 
AELHDAPKVT SAVATU)IGY FGGEIGARLI F...v 

Fig.l8B 
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SUBSTITUTE SHEET (RULE 26) 



WO 99/13720 



PCT/US98/19600 



10 20 30 40 50 60 

ATGAACTGCA AAAGATTTTT CATAGCAAGT GCATTGATAT CACTAATGTC TTTCTTACCT 
70 80 90 100 110 120 

AGCGTATCTT TTTCTGAATC AATACATGAA GATAAIAIAA ATGGTAACCT TTACATTAGT 

130 140 150 160 170 180 

GCAAAGTATA TGCCAAGTGC CTCACACTTT GGCOTAXTTT CAGTTAAAGA AGAGAAAAAC 

190 200 210 220 230 240 

ACAACAACTG GAGTTTTCGG ATTAAAAGAA GATTGGGACG GAGCAACAAT AAAGGATGCA 

250 260 270 280 290 300 

AGGAGCAGCC ACACAAIAGA CCCAAGTACA ATATTCTCCA TTTCAAATTA TTCATTTAAA 

310 320 330 340 350 360 

TATGAAAACA ATCCATTTTT AGGGTTTGCA GGAGCTATTG GCTACTCAAI GGGTGGTCCA 

370 380 390 400 410 420 

AGGGTAGAGT TTGAAGTGTC TTACGAAATA TTTGATGTAA AAAACCAAGG TAACAGTTAC 

430 440 450 460 470 480 

AAGAACGATG CTCACAAATA TTGCGCTTTA TCAAGACACA CCGGAGGTAT GCCACAAGCC 

490 500 510 520 530 540 

GGTCATCAAA ATAAATTTGT CTTCCTAAAA AATGAAGGAT TACTTGACAT AXCACTTATG 

550 560 570 580 590 600 

ATAAACGCAT GTTATGAIAT AACAATCGAC AGCATGCCAT TTTCTCCATA TATATGTGCA 

610 620 630 640 650 . 660 

GGTATTGGTA GTGACTTAGT TTCGATGTTT GAAACTACAA ATCCTAAAAT TTCTTATCAA 

670 680 690 700 710 720 

GGAAAATTAG GTGTAAGTTA CTCCATAAGC CCAGAAGCAT CTGTTTTTGT TGGAGGACAC 

730 740 750 760 770 780 

TTTCACAGAG TTATAGGTAA TGAATTTAAA GACATTCCTG CAATAACTCC TGCTGGAGCA 

790 - 800 810 820 830 840 

ACAGAAATTA AAGGCACACA GTTTACAACA GTAACATTAA ACA3ATGCCA CTTCGGACTA 

850 860 870 880 890 900 
GAGCTTGGAG GCAGGTTTAC TTTTTAA 

Fig.l9A 

XU 20 30 40 50 60 

MNCKRSTIAS ALISLM3FLP SVSFSESIHE DNTHGNFYIS AKXMPSASHF GVFSVKEBKH 
[ 70 80 90 100 110' 120 

TTTGVFGLKQ DWDGATIKDA SSSHTIDPST JFSISNYSFK YEHHPFLGEA GAIGYSMGGP 

130 140 150 .160 170 180 

"KVEFEV5YEI FDVKNQGH3Y KNDAHKYCAL SRHTGGMPQA GHQNKFVFLK NEGLLDISIM 

190 200 210 220 230 240 

IHACYDITID SMPFSFYICA GIGSDLVSHF ETTOPKISYQ GKLGVSYSIS PEASVFVGGH 

250 260 270 280 290. 300 
FHRVTGHEFK DIPAITPAGA TEXKGTQFTT. VTTHICHFGL ELGGRFTF — . • • • 

Fig. 19B 
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10 20 30 40. 50- 60 

ATGAAATATA AAAAAACTTT. TACAGTAACT GCATTAGTAT TATTAACTTC CTTTACACAT 
70 80 90 100. 110 120 

TTTATACCTT TTTM3U3TCG AGCACGTGCC AGTACAAXTC ACAACTTCTA CATTAGXGCT. 

130 140 150 160 170 180 

AAAXATATGC CAACAGCGTC ACATTTTGGA AXTTTTTCAGXTAAAGAAGA ACAAAGTTTT 

190 200 210 220 230 240 

ACTAAGGTAT TAGTTGGGTT AGATCAACGA TTATCACATA ATATEATAAA CAATAATGAT • 

250 260 270 280 290 300. 

ACAGCAAAGA GTCTTAAGGT TCAAAATTAT TCATTTAAAT ACAAAAAXAA CCCATTTCTA 

310 320 330 . 340 - 350 360 

GGATTTGCAA GAGCTAXTGG TTATTCAATA. GGCAATTCAA GAAXAGAACT AGAAGTATCA 

370 380 390 400 410 420 

CATGAAAXA7 TTGAIACTAA AAACCCAGGA AACAATTATT TAAATG&CTC TCACAAATAT 

430 440 450 460 470 480 

TGCGCTTTAT CTCATGGAAG TCACATATGC AGTGATGGAA AIAGCGGAGA TTGGTACACT 

490 500 510 520 530 540 

GCAAAAACXG AIAAGTTTGT ACTTCTGAAA- AATGAAGGTT TACTTGACGT CTCATTTATG 

550 560 .570 580 590 600 

TTAAACGCAX GTTATGACAT AACAACTGAA AAAATGCCTT TTTCACCTTA TAXATGTGCA 

610 626 630 640 650 660 

GCTATTGGXA CTGATCTCAT ATCTATGTTT GAGACAACAC AAAACAAAAT ATCTTATCAA 

670 680 690 - 700 710 720 

GGAAAGTTAG GTTTAAACTA TACTATAAAC TCAAGAGTTT CTGTTTTTGC AGGTGGGCAC 

730 740 750 760 770 780 

TTTCATAAAG TAATAGGTAA TGAAXTTAAA GGTATTCCTA CTCTATEACC TGATGGATCA 

790 800 810 820 830 840 

AACATXAAAG TACAACAGTC TGCAACAGTA ACATTAGATG TGTGCCATTT CGGGTTAGAG 

B50 860 870 880 890 900 
ATTGGAAGTA GATTTTTCTT TTAA. •••••••• .......... • ••• 

Fig- 20A 

10 20. 30 40 50 .60 

MBDrtOCmVT ALVLLTSFTH FIPEYSPARA STZHHFY ISG KXMFTASHFG IFSAKEEQ3F 
70 80 90 100 110 . 120 

TKVI»VGLDQR "I*SHHIIHHHD TAKSLKVQHT SFKYKOTgEK GEMAIGXSI CTSRIELBV3 

130 140 . . 150 ' . 160 170 180 

HEXFDTKNPG NNYLBD3HKY CALSHG3HIC. 3D GH3G D CTYT AKTDKFVliLK HBGELOT3IM 

190 . - 200 . 210 220 * - * 230 . .240 

LNACY0ITTE KMPFSPYICA GIGTDLISME BTTQNKTSTQ GKLGLHYTIH SHVSVERGGH 

250 260 1 270 280 290 t 300 

FHKYIGHBFK GXETLLBVG3 HXKVQQSAXV TIiDVCHPGLE" IGSRFFF . . • 



Fig. 20B 
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10 20 . 30 40 50 60 

ATGTTTTATA CTAAIATATA TATTCTGGCT TGTATTTACT TTGCACTTCC ACTATTGTTA 
70 80 90 100 110 120 

ATTrarrrrc actattttag gtgt&aiatg aattgcaaaa AAArrcnai aacaactgca. 

130 140 150 160 170 WO 

ttraiaicax tartgtactc TarrccaftGc ATATcrrrrr. CTGaiaaai acaagatggt 

190 200 210 . 220 . 230 240 

AftCRTGGGTG GTAACTTCTA TATTAGTGGA AAGTATCTAC CAAGTGTCTC ACATTTTGGT 
- 250 260 270 280 290 300 

AGCTTCTCAG ctaaagaaga aagcaaatca actgttggag tttttggatt' aaaacatgat 

310 320 330 . 340 . 350 360 

TGGGATGGAA GICCaaXACT TAAGAATAAA CACGCTGACT .TTACTGITCC AAACTATTCG 

370 380 390 400 410' 420 

TTCAG&TACG AGAACAATCC ATTTCTAGGG TTTGCAGGAG CTATCGGTTR. CTCAAJGGGT 

430 440 450 460 470 '480 

GGCCCAAGAA TAGAATTCGA AATATCTTAT GAAGCATTCG ACGTAAAAAG TCCTAATATC 

490 500 510 520 530 .540 

AATTATCAAA ATGACGCGCA CAGGTACTGC GCTCTATCTC ATCACACATC GGCAGCCATG 

550 560 570 580 590 600 

GAAGCTGATA AATTTGTCTT CTTAAAAAAC GAAGGGTTAA TTGACATATC ACTTGCARXA 

610 620 630 640 650 660 

ARTGCATGTT ATGATATAAT AAATGACAAA GTACCTGTTT CTCCJTRTRT ATGCGCAGGT 

670 680 690 700 710 720 

ATTGGTACTG ATTTGATTTC TATGTTTGAA GCTACAAGTC CTAAAATTTC CTRCCAAGGA 

730 740 750 760 770 780 

AAACTGGGCA TTAGTTACTC TATTAATCCG 6AAACCTCTG TTTTCATCGG TCGGCATTTC 

790 800 810 820 830 840 

CACAGGATCA TAGGTAATGA GTTTAGAGAT ATTCCTGCAA TAGTJVCCTAG TAACTCAACT 

850 860 870 880 890 3°° 

ACAATAAGTG .GACCACAATT TGCAACAGTA ACACTAAATG TGTGTCRCTT TGGTTTAGAA 

910 920 930 940 950 960 
CTTGGAGGAA GATTTAACTT CTAA 

Fig. 21A 

10 20 30 40 50 . 60 

MFrnsiYiiA cgEALpiu. imnsBcm hckkilitta lisudsibs isfsdtiqdg 

70- '•' 80 90 100 110 1 20 

HHOTimSG KZVBSVSBCG SFSAKEESKS TVGVEGLKHD WDGSPXLKHK HADFTVFHXS 

130 140 150 160 170 180 

ntXBMHPIXG FAGAIGYSMG GFEIEFEI3T EAFDVKSPHI HYQHDfiHKXC ALSHBTSAAM 

190 200. 210 .220 230 . 240 

EADKFVFLKH EGLIDISIAI HACYDHHDK: VFVSEYICAG IGIBtlSHEB MT3PKIS5QG 

250 260 - 270 280 290 .300 

KLGI3Y3IOT ETSVETGGHF HBUGHBTRD IEAIVPSHST TISGBQEBIV TIBVCHEGLE 

310 320 330 340 350 360 

LGGRFHF. . .... - -- • *** • 

Fig. -21B 
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10 20 30 40 50 60 

ATGAATTGCA AAAAAATTCT TATAACAACT GCATTAATCT CATTAATGTA CTATGCTCCA 

70 80 90 100 110 120 

AGCATATCTT TTTCTGATAC TATACAAGAC GATAACACTG GTAGCTTCTA CATCAGTGGA 
130 140 150 160 170 180 

AAAXATGTAC CAAGTGTTTC ACATTTTGGT GTTTTCTCAG CTAAAGAAGA AAGAAACTCA 

190 200 210 220 230 240 

ACTGTTGGAG TTTTTGGATT AAAACATGAT TGGAATGGAG GTACAATATC TAACTCTTCT 

250 260 270 i80 290 300 

CCAGAAAATA TATTCACAGT TCAAAATTAT TCGTTTAAAT ACGAAAACAA CCCATTCTTA 

310 320 330 . 340 350 360 

GGGTTTGCAG GAGCTATTGG TTATTCAATG GGTGGCCCAA GAATAGAACT TGAAGTTCTG 

370 380 390 400 410 420 

TACGAGACAT TCGATGTGAA AAATCAGAAC AATAATTATA AGAACGGCGC ACACAGATAC 

430 440 450 460 470 480 

TGTGCTTTAT CTCATCATAG TTCAGCAACA AACATGTCCT CCGCAAGTAA CAAATTTGTT 

490 500 510 520 530 540 

TTCTTAAAAA ATGAAGGGTT AATTGACTTA TCATTTATGA TAAATGCATG CTATGACATA 

550 560 570 580 590 600 

ATAATTGAAG GAATGCCTTT TTCACCTTAT ATTTGTGCAG GTGTTGGTAC TGATGTTGTT 

610 620 630 640 650 660 

TCCATGTTTG AAGCTATAAA TCCTAAAATT TCTTACCAAG GAAAACTAGG ATTAGGTTAT 

670 680 690 700 710 720 

AGTATAAGTT CAGAAGCCTC TGTTTTTATC GGTGGACACT TTCACAGAGT CAXAGGTAAT 

730 740 750 760 770 780 

GRATTTAGAG ACATCCCTGC TATGGTTCCT AGTGGATCAA ATCTTCCAGA AAACCAATTT 

790 800 810 820 830 840 

GCAATAGTAA CACTAAATGT GTGTCACTTT GGTTTAGAAC TTGGAGGAAG ATTTAACTTC 

850 860 870 880 890 900 
TGA ... 

Fig^22A 

10 20 30 40 50 60 
MNCKKJXITT ALHSI2CYYAP SISFSDTIQD DHTGSFYISG KXVFSVSHFG VFSAKEEBHS 
70 80 90 100 110 120 
TVGVFGLKHD WHGGTISNSS PEHIFTVQNY SFKYENHPFL GFAGAIGYSM GGPRIELEVL 
130 * 140 150 160 170 180 
. YETFDVKNQN NNYKNGAHRY CALSHH3SAT MMSSASHKFV FLKHEGUDL SFMINACYDI 
190 200 1 210 220 230 240 
IIEGMPFSPY ICAGVGTDW SMFEAINPKI STQGKLGLGY SISSEASVFX GGHFHRVIGH 
250 260 270 280 290 300 
EERDIPAMVP SGSNLPENQF. ATVTLNVCHF GLELGGRFNF 

Fig-22B 



22/31 
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WO 99/13720 PCT/US98A9600 

10 20 30 40 50 60 
ATGAATTGTA AAAAAGTTTT CACAAIAAGT GCATTGATAT CATCCATATA CTTCCTACCT 
70 80 90 100 110 120 
AATGTCTCAT ACTCTAACCC AGTAXATGGT AACAGTATGT ATGGTAATTT TTACAIATCA 
130 140 150 160 170 180 
GGAAAGTACA TGCCAAGTGT TCCTCATTTT GGAAXTTTTT CAGCTGAAGA AGAGAAAAAA 
190 200 210 220 230 240 
AAGACAACTG TAGTATAXGG CTTAAAAGGA AAACTGGCAG GAGATGCAAT ATCTAGTCAA 
250 260 270 280 290 300 
AGTCCAGATG ATAATTTTAC CATTCGAAAX TACTCATTCA AGTATGCAAG CAACAAGTTT 
310 320 330 340 350 360 
TTAGGGTTTG CAGTAGCTAI TGGTTACTCG ATAGGCAGTC CAAGAAIAGA AGTTGAGAXG 
370 380 390 400 410 420 
TCTTATGAAG CATTTGATGT GAAAAATCCA GGTGATAATT ACAAAAACGG TGCTTACAGG 
430 440 450 460 470 480 
TOTTGTGCTT TATCTCATGA AGATGATGCG GATGAXGACA TGACTAGTGC AACTGACAAA 
490 500 510 520 530 540 
TTTGTATATT TAATTAATGA AGGATTACTT AACATATCAT TTATGACAAA CATATGTTAT 
550 560 570 580 590 600 
GAAACAGCAA GCAAAAATAT ACCTCTCTCT CCTTACATAT GTGCAGGTAT TGGTACTGAT 
610 620 630 640 650 660 
TTAATTCACA TGTTTGAAAC TACACATCCT AAAATTTCTT ATCAAGGAAA GCTAGGGTTG 
670 680 690 700 710 720 
GCCTACTTCG TAAGTGCAGA GTCTTCGGTT TCTTTTGGTA TATATTTTCA TAAAA3TTAIA 
730 740 750 760 770 780 
AATAATAAGT TTAAAAATGT TCCAGCCATG GTACCTATTA ACTCAGACGA GATAGTAGGA 
790 800 810 820 830 840 
CCACAGTTTG CAACAGTAAC ATTAAATGTA TGCTACTTTG GATTAGAACT TGGATGTAGG 
850 860 870 880 890 900 
TTCAACTTCT AA * • 



Fig.23A 

xu ZO 30 40 50 60 

MNCKSVFTIS ALISSIYFLP HVSYSNPVYG NSMYGHFYIS GKYMPSVPHF GIFSAEEEKK 
70 80 90 100 110- 120 

KTTWTGLKG KLAGDAISSQ SPDDOTTIRN YSFKYASHKF LGFAVAIGYS IGSPRIEVEM 

130 140 150 . 160 170 180 

SYEAFDVKHP GDNYKNGAYR YCALSHQDDA DDDMTSAXDK FVYUHEGLL NISFMTHICY 

190 200 210 220 230 240 

ETASKNIPLS PYICAGIGTD LTHHFETTHP KISYQGKLGL A5TFVSAESSV SFGIYFHKH 

250 260 270 £80 290. 300 
RNKFKNVPAM VPIHSDEIVG PQFATVTIHV CTPGLELGCR FHF ♦ 

Fig.23B 
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10 20 30 40 50 60 

ATGAACTGTA AAAAATTTCT TATAACAACT ACATTGGTAT CACTAACAAT TCTTrTACCT 

70 80 90 100 11 

GGCATATCTT TCTCCAAACC AATACATGAA AACAATACTA CAGGAAACTT TTACAITATT 
130 140 150 160 "0 AOW 

GGAAAAIATG TACCAAGTAT TTCACATTTT GGGAACTCtT CAGCTAAAGA AGAAAAAAAC 
190 200 210 220 230 J«« 

ACAACTACTG GAfilTTTTGG ATTAAAAGAA TCATGGACTG ^"^^^Q 
250 260 270 280 290 *™ 

GAACATGCAG CTTTTAATAT CCCAAATTAT TCATTTAAAT ATGAAAAIAA TCCATTTTTA 

310 320 330 340 350 

GGATTTGCAG GGGTAATTGG CTATTCAATA GGTAGTCCAA GAATAGAATT TGAAGTATCA 

370 380 390 400 410 

TACGAGACAT TCGATGTACA AAATCCAGGA GATAAGTTTA AOttlGAXGC ACATAAGTAT 
430 440 450 460 470 «»" 

TGTGCTTTAT CCAATGATTC CAGTAAAACA ATGAAAAGTG GTAAATTCGT TTTTCTCAAA 
490 500 510 520 530 34U 

AATGAAGGAT TAAGTGACAT ATCACTCATG TTAARIGTAT GTTATGATAT AATAAACAAA 
550 560 570 580 590 600 

AGAATGCCTT TTTCACCTTA CATATGTGCA GGCAJTGGTA CTGACTTAAT ATTCATGTrr 
610 620 630 640 650 660 

GACGCXATAA ACCATAAAGC TGCTTATCAA GGAAAATTAG GTTTTAATTA TCCAATAAGC 

670 680 690 700 710 

CCAGAAGCTA ACATTTCTAT' GGGTGTGCAC TTTCACAAAG TAACAAACAA CGAGTTXAGA 
730 740 750 760 770 780 

GTTCCTGTTC TATTAACTGC TGGAGGACTC GCTCCAGATA ATCTATTTGC AATAGTAAAG 

790 800 810 820. 83 0 
TTGAGTATAT GTCATTTTGG GTTAGAATTT GGGTACAGGG TCAGTTTTTA A 

Fig.24A 

10 20 30 ' 40 50 60 

MH CKK FL ITC TLVSLT1LLP GISFSKPIHE HHTTGHFYII GKXVPSISHP GHFSAKEEKH 
7tt \ 80 90 .100 • HO 120 

TTTGIFGLKE SWTGGIIIJJK EHAAEHIPHY SFKYENHPEX GkAUVlGYSI GSPRIEFEVS 
130 140 .150 160 170 180 

YETFDVQHPG DKFNNDAHKY CALSBDSSKX MKSGKFVFLK HEGLSDISLM IHVCCTHHK 

190 200 210 220 230. . 240 

HMPFSPYICA GIGTDLIFMF DAINHKAAYQ GKLGFHTPI3 PEANISHGVH EBKVTHHEER 
:. 250 260 . 270 280 290 300 
VPVLLTAGGL APDHLFATVK LSICHFGLEF GTfRVSF • 

FIg.24B 
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10 20 30 40 50 60 
ATGAATJAATA AACTCAAATT TACTATAAIA AACACAGTAT TAGTATGCTT AITGTCATTA 
70 80 90 100 110 120 
CGTAATATAT CTTCCTCAAA GGCCATAAAC AATAACGCTA AAAAGTACTA CGGATTATAT 
• 130 140 150 160 170 180 
ATCAGTGGAC AATATAAACC CAGTGTTTCT GTTTTCAGTA AXTTTTCAGT TAAAGAAACC 
190 200 210 220 230 240 
AATGTCATAA CTAAAAACGT TATAGCTTTA AAAAAAGATG TTGACTCTAT TGAAACCAAG 
250 260 270 280 290 300 
ACTGATGCCA GTGTAGGTAT TAGTAACCCA TCAAATTTTA CTATCCCCTA TACAGCTGTA 
310 320 330 . 340 350 360 
TTTCAAGATA ATTCTGTCAA TTTCAATGGA ACTATTGGTT ACACCTTTGC TGAAGGTACA 
370 380 390 400 410 420 
AGAGTTGAAA TAGAAGGTTC TTATGAGGAA TTTGATGTTA AAAACCCTGG AGGCTAIACA 
430 440 450 460 47X) 480 
CTAAGTGATG CCTATCGCTA TTTTGCATTA GCACGTGAAA TGAAAGGTAA TAGTTTTACA 
490 500 510 520 530 540 
CCTAAAGAAA AAGTTTCTAA TAGTATTTTT GACACTGTAA TGAGAAATGA TGGATTATCT 
550 560 570 580 590 600 
ATAATATCTG TTATAGTAAA TGTTTGCTAC GATTTCTCTT TGAACAATTT GTCAATATCG 
610 620 630 640 650 . 660 
CCTTACATAT GTGGAGGAGC AGGGGTAGAT GCTATAGAAT TCTTCGATGT ATTACACATT 
670 680 690 700 710 720 
AAGTTTGCAT ATCAAAGCAA GCTAGGTATT GCTTATTCTC TAGCATCTAA CATTAGTCTC 
730 740 750 760 770 780* 
TTTGCTAGTT TAIATTACCA TAAAGTAATG GGCAATCAA3T TTAAAAATTT AAATGTCCAA 
790 800 810 820 830 840 
CATGTTGCTG AACTTGCAAG TATACCTAAA ATTACATCCG CAGTTGCTAC ACTTAATATT 
850 860 870 880 890 900 
GGTTATTTTG GAGGTGAAAT TGGTGCAAGA TTGACATTTT AA 

Fig.25A 

10 20 30 40 50 60 

MKHKLKFTH NTVLVCLLSI. PNIS5SKAIN N MAKK XY GLY ISGQYKPSVS VFSHFSVKET 
70 - 80 90 100 110 120 

NVXTRHLXAL KKDVDS1KTK TDASVGISNP SNFTIFTEAV FQBBSVOTNG TIGYTFAEGT 

130 140 - 150 160 170 180 

RVEIEGSYEE FDVKNPGGOT LSDAYRYFAL AREMKGHSFT P K EKVSH SIF HTVMRHDGL3 

190 200 210 220 230 240 

nSVIVHVCY DFSLNNLSIS PYICGGAGVD AIEFFDVLHT KFAYQSKLGI AYSLPSHISL 

250 260 270 280 290 300 

EASLYYHKVM GNQFKHLHVQ HVAELASIPK ITSAVATLNI GYFGGEIGAR LTF. ••»••• 

Fig*25B 



25/31 



SUBSTITUTE SHEET (RULE 26) 



WO 99/13720 



PCT/US98/19600 



10 


20 


30 


40 


50 


60 


ATGGCAAATT, 


TTAXGTACAA 


AAAATACAAA 


CTAATGACAG 


CAGGTGTAGT ATTATTTCAC 


70 


80 


90 


100 


110 


120 


ATGTTATTTC 


TACCTCATGT 


TTCTTTCGCA 


AAAAATACAA 


ACAGCAATAA ACTTGGATTA 


130 


140 


150 


160 


170 


180 


TACATCAGTG 


GACAGTATAA 


CCCTAGTGTT 


TCTGTTTTTA 


GCAATTTTTC AGCAAAAGAA 


190 


200 


210 


220 


230 


240 


ACGAATGTTC 


ATACAGTACA 


ACTCATGGCG 


CTTAAAAAAG 


ACATTGATTC TATTCAAGTT 


250 


260 


270 


280 


290 


300 


GATACTGGAA 


ATAGCGCAGG 


TATTAGCAAA 


CCACAAAATT 


TCACAGTTCT 


TTATACTCCA 


310 


320 


330 


340 


350 


360 


AAATTTCAAG 


ATAATGTTGC 


TGGTCTTAGC 


GGTGCACTTG 


GATTCTTTTA 


TTCTAAAGGA 


370 


380 


390 


400 


410 


420 


TTAAGGATTG 


AAATGGGGTT 


TTCTTATGAA 


AAATTTGATG 


CTAAAGACCT 


TGGTGAGTAC 


430 


440 


450 


460 


470 


480 


ACCAAAATAA 


AAGATGCTTA 


TAGATATTTT 


GCTCTAGTAC 


GTGAAATGCA 


TGTTAGTCTC 


490 


500 


510 


520 


530 


540 


ATTTATCCAA 


AAGATAATAA 


CACAGGAACA 


CATTATACTG 


TTATGAGAAA 


TGATGGTATA 


550 


560 


570 


580 


590 


600 


TCTATTTCTT 


CTGCTACAGT 


AAATGGCTGC 


TATGATTCTT 


TTTTCCAGTT 


TATCTTTGTC 


610 


620 


630 


640 


650 


660 


ACCTATATGT 


GTATAGGCAT 


CGGTATAGAT 


GCTAIAGAAT 


TTCTTAATGC 


ATACATATTA 


670 


680 


690 


700 


710 


720 


AGTTTGCTTG 


CCAAGGTAGT 


TAAGGTGTTA 


ACTTATTCTG 


TATCTCCCAA 


TGTTAATTTA 


730 


740 


750 


760 


770 


780 


TTTGCAGATG 


GATATTATCA 


TAAAGTGATG 


GGCAAIAAAT 


TTAAAAATTT 


ACCTGTTGAA 


790 


800 


810 


820 


830 


840 


TACGTTAATA 


CTTTAGAAGA 


GTATCCAAGA 


GTTACATCTG CAATTGCTAC 


ACTTGATATT 


850 


860 


870 


880 


890 


900 


GGCTACCTCG 


GTGGTGAAAT 


TGGCATAAGA 










Fig-26A 






10 


20 


30 


40 


50 


60 



MYKKYKLMTA GWLFBMLFL PHVSFAKNTN SHKLGLYISG QYNPSVSVFS NFSAKETHVH 
70 80 .90 100 110 120 

TVQLMALKKD IDSIEVDTGH SAGI5KPQHF TVMTPKFQD NVAGLSGALG FFYSKGLRTE 

130 140 150 • 160 170 180 

MGFSTEKFDA KDLGEXTKIK DAXRYFALVR EMHVSLXTPK DHHTGTHXTV MRHDGI3I3S 

190 200 210 220 230 240 

ATVHGOfDSF FQFIFVTXHC XGXGXDAXEF LNAYILSLLA KV VK VL T V S V SPNVNLFADG 

250 260 270 280 290 300 
YYHKVMGHKF KHLPVQYVHT LEEYPRVTSA IATLDIGYLG GBIGIRFIF. 

Fig.26B 
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10 20 30 40 50 60 

ATGGGAAAXT CTATGAATAA TAAAAGTCAA TTCTTAAIAA GAXTTATATT TTTAACATGC 

70 80 90 100 110 120 

ATGCTGTCAT TACCTAAIAT ATCTCTTTCA AAAGTAAATA ACGAAAAAGA TTCTGGTTTG 
130 140 150 160 170 180 

TAXATTAGCG GGGAAXACAA ACCCAGTGTT TCTGTTTTCA GTAATtTTTC AGTTAAAGAA 
190 200 210 220 230 240 

ACCAACTTTC AZACAAAAGA TCTCATAGCT CTTAAACAAG ATGTTGATTC TGTTGAAATT 

250 260 270 280 290 300 

GATACTGGTA GTAATACAGC AGGTAXTAGT AACCCAXCTA ACTTTACAAT CCCTTATACT 

310 320 330 340 350 360 

GCAGAATTTC AAGACAACCA TACTAACTGC AATGGCTCTA TTGGTTATGC TTTTGCTGAA 

370 380 390 400 410 420 

GGTCCAAGAA TTGAAATAGA ATTATCATAT GAAAAATTTG ATGTTAAAAA TCCCACAGGG 

430 440 450 460 47.0 480 

TATACTACAG TAAAAGATGC TTATAGAXAC TTTGCTTTAG CACGTGAAAT AAA3JAXTTCT 

490 500 510 520 530 540 

CTATTCCAAC CAAAACAAAA AGAAGGTAGT GGAAXTTACC ATGTCGTAAT GAAAAACGAT 

550 560 570 580 590 600 

GGGTTATCTA TCTTATCCAA TAIAGTTAAT ATTTGCTACG ATTTTTCTTT AAATAATTTA 

610 620 630 640 650 660 

CCTATATCAC CTTATTTATG CGGAGGAATG GGTATAAATG CCAIAGAATT CTTTGACGCT 

670 680 690 700 710 720 

TTACATGTGA AATTTGCTTA TCAAAGCAAG GCAGGAATTA GTTATCAACT ATTACGTAAA 

730 740 750 760 770 780 

ATCAACTTAT TTATTGATGT ATATTACTAC GAAGTAAXAA GTAAIAAATT TAAAAACCTG 

790 800 810 820 . 830 840 

AAAGTCCAAC ATGTACATGA ACTTAAAGAT AATCCAAAAG TCACATCTGC AGTTGCTACA 

850 860 870 880 890 900 
CTTGATATAG CATATTTTGG TAGTGAAGCT GGCATAAGAA TTATATTTTA A - - • • 

Fig.27A 

xu. £s* jit 40 50 60 

MNNKSQFLIR FIFLTCKLSL PNISLSKVNN EKHSGLYX3G QYKPSVSVFS NFSVKETHFH 
70 80 90 100 110- 120 

TKHLTALKQD VDSVEIDTGS HTAGXSNPSN FTIPYTAEFQ DHHTKCHGSI GYAFAEGPRI 

130 140 150 " . 160 .170 180 

3XELS2EKFD VKHPTGYTTV KDAYRYFALA REUEtSLFQP KQKEGSGIYH WHKNDGLSX 

190 200 210 220 230 240 

LSHIVNICTD FSLBHLPISP YLCGGHG1NA IEFFDALHVK EAYQSKAGIS YQLLRKIHLF 

250 m 260 270 280 290. 300 
IDVYYYEVIS NKFKNLKVQH VHELKDNPKV TSAVATLDIA YFGSEAGIBX IF. . . 

Fig. 27B 
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AXGAAIAGCA AGAGTAAGTT CTTTACAATA TGTACATCGT TAATATGCTT ATTATCATCA 

70 80 90 100 110 120 

CCTAACACAT CTCTCTCAAA CTTCAIAGGC AAIAGTACAA AACATTCTGG ATTATATGTT 
130 140 150 160 170 180 

AGCGGACAAT AXAAGCCGAG CGTTTCCATT TTTAGCAAAT TTTCAGTAAA AGAAACAAAT 

190 200 210 220 230 240 

ACACATACAG TACAGTTAGT AGCTCTTAAA AAAGATGTTA ATTCTATTTC TATGAACATC 

250 260 270 280 290 300 

AGTAATGGTG CTACAGGCAX TAGCAAAGCA ACAAATTTTA ATCTTCCTTA TGTTGCAGAA 

310 320 . 330 - 340 350 360 

TTTCAAGACA ATGCCTTCAA CTTCAGTGGA GCTATTGGTT ATTCACTTTT TGAACAACTA 

370 380 390 400 410 420 

AACATTGAAG TTGAAGGTTC TTATGAAGAA TXCGATGCCA AAAATCCTGG TGGTXATATT 

430 440 450 460 470 480 

TTAAATGATG CATTCCGCTA TTTTGCATTG GCACGTGAAA TGGGACAAGA AAAAAATGAT 

490 500 510 520 530 540 

AAIAAGCATC TTAGTCCTAA GGAGGAGCAT GATATAAGTA AAACATATTA CACAGTCATG 

550 560 570 580 590 600 

AGAAATAATG GGTXATCTAT ATTATCTATT ATGATAAATG GCTGCTATAA TCTACCTCTC 

610 620 630 640 650 660 

AATGATTTAT GAAIAXCACC TTATTTTTGT ACAGGAAIAG GTGTAGATGC TATAGAATTT 

670 680 690 700 . 710 720 

TTTGATGCAC TGCATCTTAA ACTTGCTTTG CAAAGTAAAA TAGGAGCTAC TTACCAATEA 

730 740 750 760 770 780 

TCAGACAACA TTAGTTTATT TACAAATGGA TATTACCATC AAGTAATAGG TGATCAATTT 

790 800 810 820^ . 830 840 

AAAAACTTAA AAGTCCAATA TATAGGTGAA CTTAAAGA6A ACCCGAAAAT TACAICTGCA 

850 860 870 880 890 900 

GTTGCTACTC TCAATGTTGG ATACTTTGGA GGXGAAATTG GAGTAAGACT CACACTTTAA 

910 920 930 940 950 960 



Fig.28A 
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MHSKSKFFTI CTSLlCLLSS FHTSLSHFIG NSTKHSGLYV SGQYKPSVSI FSKFSVKETM 
70. 80 90 * 100 UO 120 

THTVQLVALK KDVHSISMNI SHGATGX3KA TNFHLPYVAB FQDNAFNFSG AIGYSLFBQL 

130 140 150 160 170 180 

HIEV E GSY E B FOAKHPGGYX LNDAFRYFAL AREMGQEKHD NKHLSPKEEH DISKTYYTVM 

190 200 210 220 230' 240 

RHHGLSXLSX MH3GCYHLPL NDLSISPYFC TGIGVDAIEF FDALHLKLAL Q3KEGATYQI. 

250 260 . 270 280 290 300 

SDNISWTNG YCTQVIGDQF KRLKVQYIGE LKENPKTTSA VATLNVGYFG GEIGVRLTL. 

Fig.28B 
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AAGCTTCTTA TGAAGAATTT GACGTTAAAA ATCCTGAAGG ATCTACTACA GACTCCTATA 
70 80 90 100 110 120 

GATATTTCGC GTTAGCACGT GGCATGGATG GTAATAATAT TCCTACAAGT CAAAAATTTA 

130 140 150 160 170 . 180 

CTGTAATGAG AAACGACGGG TTATTAATCT CATCTGTTAT GATAAATGGC TGTTACAATG 

190 200 210 220 230 240 

TCATACTAAA TGATATACAA GCAGAACCTT ACATATGTGC AGGACTAGGA GGAGATTTTA 

250 260 270 280 290 300 

TAGAATTCTT CAATGGCTTT CATGTTAAGC TAGCTTATCA AGGTAAAGTA GGCATTAGTT 

310 .320 330 340 . 350 360 

ATCAAATATT CCCTGAAGTA AGA1TATTTA TTGATGGATA CTACCATAAA GTAAAAGGCA 

370 380 , # 390 400 410 420 

ACAAGTTTAA AAATTTACAC GTTCAACAXG TAGGTGCACT TGCAGCACTC CCTAAAGTTA 

430 440 450 460 470 '480 

CATCTGCAGT TGCAACACTT AATATTGGAT ACTTTGGTTG TGAAGCTGGA GTAAGATTCA 

490 500 510 520 530 540 
TATTTTAA. . .......... ••••«••«•. . . . . . 

Fig. 29 A 
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ASYEEFDVKN PEC25TTDSYR YEAIARGMDG NNIPTSQKFT VMRNDGLLXS SVMINGCYNV 
. 70 80 90 100 U0 120 
IUJDIQAEPY ICAGLGGDFI EFFHGFHVKL AYQGKVGISY QIFPEVRLFI DGYYHKVKGN 
130 140 150 160 . 170 180 
KFKNLHVQHV GALAALPKVT SAVATLNIGY FGCEAGVRFI F 

Fig.29B 
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ATGAATTATA AGAAAATTCT AGTAAGAAGC GCGTTAATCT CATEAATGTC AATCTTACCA 

70 80 90 100 HO 120 

TATCAGTCTT TTGCAGATCC TGTAGGTTCA AGAACTAATG AIAACAAAGA AGGCTTCTAC 

130 140 150 160 170 180 

ATTAGTGCAA AGTACAATCC AAGTAIATCA CACTTTAGAA AAXTCTCTGC TGAAGAAACT 

190 200 210 220 230 m 240 

CCTATTAATG GAACAAATTC TCTCACTAAA AAAGTTTTCG (SVCTAAAGftA AGATGGTGAX 

250 260 . 270 280 290 300 

AIAACAAAAA AAGACGATTT TACAAGAGTA GCTCCAGGCA TTGATTTTCA AAAIAACTTA 
'310 320 330 340 350 360 

ATATCAGGAT TTTCAGGAAG TATTGGTTAC TCTATGGACG GACCAAGAAT AGAACTTGAA 

370 380 390 400 410 420 

GCXGCA33VTC ACAATTTBAT CCAAAAACAC GATAACAATG* ATACTGATAA TGGTGAATAC 

430 440 450 ' 460 470 '480 

TAXAAACATT •TTGCATATCT CGTAAAGATG CCATGGAAGA TCAGCCATAT GTTGTTCTTA 

490 500 510 520 530 540 
AAAATGACGG CAXAC . 

Fig.30A 
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HtenmS ALISIiMSH^* yqsfadpvgs rtndhkegw isakynpsis hfrkfsa^t 



10 20 30 40 50 60 

GS 

70 80 90 100 110 

PIKGTNSLTK KVFGLKKDGD ITKKDDFTRV APGIDFQNNL ISGFSGSIGY SMDGFRIELE 

130 140 150 160 170 
AAYHHLIQKH DNHDTDNGEY YKHFAYLVKM PWKISHMLFL KMTAY 



Fig.30B 
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